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10 BACKGROUND OF THE INVENTION 

Cross-Reference to Related Application 

This application in a continuation-in-part of USSN 
09/027,337, filed February 20, 1998 and thereby claims the benefit 
15 of priority under 35 USC §120. 

Field of the Invention 

The present invention relates generally to the fields of 
cellular biology and the diagnosis of neoplastic disease. More 
20 specifically, the present invention relates to an extracellular serine 



protease, termed tumor antigen-derived gene 15 (TADG-15), which is 
overexpressed in carcinomas. 

Description of the Related Art 

Extracellular proteases have been directly associated with 
tumor growth, shedding of tumor cells and invasion of target organs. 
Individual classes of proteases are involved in, but not limited to, (a) 
digestion of stroma surrounding the initial tumor area, (b) digestion 
of the cellular adhesion molecules to allow dissociation of tumor cells; 
and (c) invasion of the basement membrane for metastatic growth 
and activation of both tumor growth factors and angiogenic factors. 

In the process of cancer progression and invasion, 
proteases mediate specific proteolysis and contribute to the removal 
of extracellular matrix components surrounding tumor cells, the 
digestion of intercellular adhesion molecules to allow dissociation of 
malignant cells and the activation of many growth and angiogenic 
factors. 1 " 3 Depending on the nature of their catalytic domain, 
proteases are classified into four families: serine proteases, 
metalloproteases, aspartic proteases and cysteine proteases. 3 Among 
these proteases, the metalloproteases have been well studied in 
relation to tumor growth and progression, and they are known to be 



capable of degrading the extracellular matrix, thereby enhancing the 
invasive potential of malignant cells. 1,4 ' 5 For serine proteases, 
previous studies have demonstrated an increased production of 
plasminogen activator in tumor cells and a positive correlation 
5 between plasminogen activator activity and aggressiveness of 
cancer. 6,7 Prostate specific antigen (a serine protease) has also been 
widely used as an indicator of abnormal prostate growth. 8 More 
recently, several other serine proteases have been reported, viz. 
hepsin and the stratum corneum chymotryptic enzyme (SCCE), which 
10 are overexpressed in ovarian cancer and which may contribute to 
malignant progression by increasing the extracellular lytic activity of 

these tumor cells. 9 

The prior art is deficient in the lack of effective means of 
screening to identify proteases overexpressed in carcinoma. The 
15 present invention fulfills this longstanding need and desire in the art. 



SUMMARY OF THE INVENTION 



20 The present invention discloses a screening program to 

identify proteases overexpressed in carcinoma by examining PCR 
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products amplified using differential display in early stage tumors 
and metastatic tumors compared to that of normal tissues. The 
approach herein to identify candidate genes overexpressed in tumor 
cells has been to utilize the well conserved domains surrounding the 

5 triad of amino acids (His-Asp-Ser) prototypical of the catalytic 
domain of serine proteases. Herein, evidence is presented for a 
unique form of serine protease not previously described in the 
literature which is highly expressed in ovarian carcinomas. Through 
the screening approach using differential PCR amplification of 

10 normal, low malignant potential and overt carcinomas, a PCR product 
present only in carcinoma was subcloned and sequenced, and was 
found to have a catalytic domain which was consistent with the 
serine protease family. Reported herein is the complete cloning and 
sequencing of this transcript and evidence for its expression in 

15 ovarian tumor cells. 

In one embodiment of the present invention, there is 
provided a DNA encoding a tumor antigen-derived gene (TADG-15) 
protein, selected from the following: (a) an isolated DNA which 
encodes a TADG-15 protein; (b) an isolated DNA which hybridizes 
20 under high stringency conditions to the isolated DNA of (a) above and 
which encodes a TADG-15 protein; and (c) an isolated DNA differing 
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from the isolated DNAs of (a) and (b) above in codon sequence due to 
the degeneracy of the genetic code, and which encodes a TADG-15 
protein. The embodiment further includes a vector comprising the 
TADG-15 DNA and regulatory elements necessary for expression of 
the DNA in a cell. Additionally embodied is a vector in which the 
TADG-15 DNA is positioned in reverse orientation relative to the 
regulatory elements such that TADG-15 antisense mRNA is produced. 

In another embodiment of the present invention, there is 
provided an isolated and purified TADG-15 protein coded for by DNA 
selected from the following: (a) an isolated DNA which encodes a 
TADG-15 protein; (b) an isolated DNA which hybridizes under high 
stringency conditions to isolated DNA of (a) above and which encodes 
a TADG-15 protein; and (c) an isolated DNA differing from the 
isolated DNAs of (a) and (b) above in codon sequence due to the 
degeneracy of the genetic code, and which encodes a TADG-15 
protein. 

In yet another embodiment of the present invention, 
there is provided a method for detecting TADG-15 mRNA in a 
sample, comprising the steps of: (a) contacting a sample with a probe 
which is specific for TADG-15; and (b) detecting binding of the probe 
to TADG-15 mRNA in the sample. In still yet another embodiment of 

S 



the present invention, there is provided a kit for detecting TADG-15 
mRNA, comprising: an oligonucleotide probe specific for TADG-15. A 
label for detection is further embodied in the kit. 

The present invention additionally embodies a method of 
5 detecting TADG-15 protein in a sample, comprising the steps of: (a) 
contacting a sample with an antibody which is specific for TADG-15 
or a fragment thereof; and (b) detecting binding of the antibody to 
TADG-15 protein in the sample. Similarly, the present invention also 
embodies a kit for detecting TADG-15 protein, comprising: an 

10 antibody specific for TADG-15 protein or a fragment thereof. Means 
for detection of the antibody is further embodied in the kit. 

In another embodiment, the present invention provides 
an antibody specific for the TADG-15 protein or a fragment thereof. 

In yet another embodiment, the present invention 

15 provides a method of screening for compounds that inhibit TADG-15, 
comprising the steps of: (a) contacting a sample comprising TADG-15 
protein with a compound; and (b) assaying for TADG-15 protease 
activity. Typically, a decrease in the TADG-15 protease activity in 
the presence of the compound relative to TADG-15 protease activity 

20 in the absence of the compound is indicative of a compound that 
inhibits TADG-15. 



In still yet another embodiment of the present invention, 
there is provided a method of inhibiting expression of TADG-15 in a 
cell, comprising the step of: (a) introducing a vector into a cell, 
whereupon expression of the vector produces TADG-15 antisense 
mRNA in the cell which hybridizes to endogenous TADG-15 mRNA, 
thereby inhibiting expression of TADG-15 in the cell. 

Further embodied by the present invention, there is 
provided a method of inhibiting a TADG-15 protein in a cell, 
comprising the step of: (a) introducing an antibody specific for a 
TADG-15 protein or a fragment thereof into a cell, whereupon 
binding of the antibody to the TADG-15 protein inhibits the TADG-15 
protein. 

In an embodiment of the present invention, there is 
provided a method of targeted therapy to an individual, comprising 
the step of: (a) administering a compound containing a targeting 
moiety and a therapeutic moiety to an individual, wherein the 
targeting moiety is specific for TADG-15. 

In an embodiment of the present invention, there is 
provided a method of diagnosing cancer in an individual, comprising 
the steps of: (a) obtaining a biological sample from an individual; and 
(b) detecting TADG-15 in the sample, wherein the presence of TADG- 
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15 in the sample is indicative of the presence of carcinoma in the 
individual and the absence of TADG-15 in the sample is indicative of 
the absence of carcinoma in the individual. 

In another embodiment of the present invention, there is 
5 provided a method of vaccinating an individual against TADG-15, 
comprising the steps of: (a) inoculating an individual with a TADG-15 
protein or fragment thereof that lacks TADG-15 protease activity, 
wherein the inoculation with the TADG-15 protein or fragment 
thereof elicits an immune response in the individual, thereby 
10 vaccinating the individual against TADG-15. 

In an embodiment of the present invention, there is 
provided a method of producing immune-activated cells directed 
toward TADG-15, comprising the steps of: exposing dendritic cells to 
a TADG-15 protein or fragment thereof that lacks TADG-15 protease 
15 activity, wherein the exposure to said TADG-15 protein or fragment 
thereof activates the dendritic cells, thereby producing immune- 
activated cells directed toward TADG-15. 

In another embodiment of the present invention, there is 
provided an immunogenic composition, comprising an immunogenic 
20 fragment of a TADG-15 protein and an appropriate adjuvant. 



§ 



Other and further aspects, features, and advantages of 
the present invention will be apparent from the following description 
of the presently preferred embodiments of the invention given for 
the purpose of disclosure. 



BRIEF DESCRIPTION OF THE DRAWINGS 

10 So that the matter in which the above-recited features, 

advantages and objects of the invention, as well as others which will 
become clear, are attained and can be understood in detail, more 
particular descriptions of the invention briefly summarized above 
may be had by reference to certain embodiments thereof which are 

15 illustrated in the appended drawings. These drawings form a part of 
the specification. It is to be noted, however, that the appended 
drawings illustrate preferred embodiments of the invention and 
therefore are not to be considered limiting in their scope. 

Figure 1 shows a comparison of the serine protease 

20 catalytic domain of TADG-15 with Hepsin (Heps, SEQ ID No. 3), SOCE 
(SEQ ID No. 4), Trypsin (Try, SEQ ID No. 5), Chymotrypsin (Chymb, 
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SEQ ID No. 6), Factor 7 (Fac7, SEQ ID No. 7) and Tissue plasminogen 
activator (Tpa, SEQ ID No. 8). The asterisks indicate conserved amino 
acids of catalytic triad. 

Figure 2 shows the nucleotide sequence of the TADG-15 
cDNA and the derived amino acid sequence of the TADG-15 protein. 
The putative start codon is located at nucleotides 23-25. The 
potential transmembrane sequence is underlined. Possible N-linked 
glycosylation sites are indicated by a broken line. The asterisks 
indicate conserved cysteine residues of CUB domain. The SDE-motifs 
of the LDL receptor ligand binding repeat-like domain are boxed. 
The arrow shows the arginine-valine bond cleaved upon activation. 
The conserved amino acids of the catalytic triad; histidine, aspartic 
acid and serine residues are circled. 

Figure 3 shows the amino acid sequence of the TADG-15 
protease, including functional sites and domains. 

Figure 4 shows a diagram of the TADG-15 protein. 1; 
cytoplasmic domain, (aa #1-54), 2; transmembrane domain (aa #5 5- 
57), 3; extracellular domain (aa #78-213), 4-5; CUB repeat (aa #214- 
447), 6-9; LDL receptor ligand binding repeat (class A motif) like 
domain (aa #453-602), 10; serine protease (aa #615-855). 



10 



Figure 5 shows Northern blot analysis of TADG-15 
mRNA expression in normal ovary, ovarian carcinomas, carcinoma 
cell lines, normal fetal tissues and normal adult tissues. A single 
intense transcript of the TADG-15 was observed in every sub-type of 
5 carcinoma and the two ovarian carcinoma cell lines, SW626 and 
CAOV3, whereas no visible band was detected in normal ovary or the 
two breast cancer cell lines. In normal fetal tissues, fetal kidney 
showed increased transcript and faint expression was detected in 
fetal lung. In normal adult tissues, the TADG-15 was detected in 
10 colon with low expression in small intestine and prostate. 

Figure 6 A shows quantitative PCR analysis of TADG-15 
expression. Expression levels of TADG-15 relative to p-tubulin are 
significantly elevated in all LMP tumors and carcinomas compared to 
that of normal ovaries, m; mucinous, s; serous. Figure 6B shows the 
15 ratio of TADG-15 expression to expression of P-tubulin in normal 
ovary, LMP tumor and ovarian carcinoma. TADG-15 mRNA 
expression levels were significantly elevated in both LMP tumor (*; 
p<0.001) and carcinoma (**; p<0.0001) compared to that in normal 
ovary. All 10 samples of normal ovary showed a low level of TADG- 
20 15 expression. 
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Figure 7 shows the TADG-15 expression in tumor cell 
lines derived from both ovarian and breast carcinoma tissues. 

Figure 8 shows the overexpression of TADG-15 in other 
tumor tissues. 

Figure 9 shows SW626 and CAOV3 cell lysates that were 
separated by SDS-PAGE and immunoblotted. Lanes 1 and 2 were 
probed with rabbit pre-immune serum as a negative control. Lanes 
3 and 4 were probed with polyclonal rabbit antibody generated to a 
carboxy terminal peptide from TADG-15 protein sequence. 

Figure 10 shows that immunohistochemical staining of 
normal ovarian epithelium (Figure 10A) with a polyclonal antibody 
to a TADG-15 protease peptide shows no staining of the stroma or 
epithelium. However, antibody staining of carcinomas confirms the 
presence of TADG-15 expression in the cytoplasm of a serous low 
malignant potential tumor (Figure 10B); a mucinous low malignant 
potential tumor (Figure 10C); a serous carcinoma (Figure 10D); 
and its presence in both the cytoplasm and cell surface of an 
endometrioid carcinoma (Figure 10E). 

Figure 11 shows an alignment of the human TADG15 
protein sequence with that of mouse epithin which demonstrates 
that the proteins are 84% similar and 81% identical over 843 amino 

12 



acids. Residues that are identical between the two proteins are 
indicated by a while the "*" symbol represents the TADG15 
translation termination. The most significant difference between 
these two proteins lies in the carboxy-termini, which for epithin, 
includes 47 amino acids that are not present in TADG15. 

Figure 12 shows a nucleotide sequence comparison 
between TADG-15 and human SNC-19 (GeneBank Accession No. 
#U20428). 



DETAILED DESCRIPTION OF THE INVENTION 

Proteases have been implicated in the extracellular 
modulation required for tumor growth and invasion. In an effort to 
categorize those proteases contributing to ovarian carcinoma 
progression, redundant primers directed towards conserved amino 
acid domains surrounding the catalytic triad of His, Asp and Ser were 
utilized to amplify serine proteases differentially expressed in 
carcinomas. Using this method, a serine protease named TADG-15 
(tumor antigen-derived gene 15) has been identified that is 
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overexpressed in ovarian tumors. TADG-15 appears to be a 
transmembrane multidomain serine protease. TADG-15 is highly 
overexpressed in ovarian tumors based on PCR, Northern blot and 
immunolocalization. 

The TADG-15 cDNA is 3147 base pairs long (SEQ ID No. 1 ) 
encoding for a 855 amino acid protein (SEQ ID No. 2). The 
availability of the TADG-15 gene provides numerous utilities. For 
example, the TADG-15 gene can be used as a diagnostic or 
therapeutic target in ovarian and other carcinomas, including breast, 

prostate, lung and colon. 

The present invention is directed to DNA encoding a 
tumor antigen-derived gene (TADG-15) protein, selected from the 
following: (a) an isolated DNA which encodes a TADG-15 protein; (b) 
an isolated DNA which hybridizes under high stringency conditions to 
the isolated DNA of (a) above and which encodes a TADG-15 protein; 
and (c) an isolated DNA differing from the isolated DNAs of (a) and 
(b) above in codon sequence due to the degeneracy of the genetic 
code, and which encodes a TADG-15 protein. It is preferred that the 
DNA has the sequence shown in SEQ ID No. 1 and the TADG-15 
protein has the amino acid sequence shown in SEQ ID No. 2. 



The present invention is directed toward a vector 
comprising the TADG-15 DNA and regulatory elements necessary for 
expression of the DNA in a cell, or a vector in which the TADG-15 
DNA is positioned in reverse orientation relative to the regulatory 
5 elements such that TADG-15 antisense mRNA is produced. Generally, 
the DNA encodes a TADG-15 protein having the amino acid sequence 
shown in SEQ ID No. 2. The invention is also directed toward host 
cells transfected with either of the above-described vector(s). 
Representative host cells are bacterial cells, mammalian cells, plant 
10 cells and insect cells. Preferably, the bacterial cell is E. coli. 

The present invention is directed toward an isolated and 
purified TADG-15 protein coded for by DNA selected from the 
following: (a) an isolated DNA which encodes a TADG-15 protein; (b) 
an isolated DNA which hybridizes under high stringency conditions to 
15 isolated DNA of (a) above and which encodes a TADG-15 protein; and 
(c) an isolated DNA differing from the isolated DNAs of (a) and (b) 
above in codon sequence due to the degeneracy of the genetic code, 
and which encodes a TADG-15 protein. Preferably, the protein has 
the amino acid sequence shown in SEQ ID No. 2. 
20 The present invention is directed toward a method for 

detecting TADG-15 mRNA in a sample, comprising the steps of: (a) 



contacting a sample with a probe which is specific for TADG-15; and 
(b) detecting binding of the probe to TADG-15 mRNA in the sample. 
The present invention is also directed toward a method of detecting 
TADG-15 protein in a sample, comprising the steps of: (a) contacting 
a sample with an antibody which is specific for TADG-15 or a 
fragment thereof; and (b) detecting binding of the antibody to TADG- 
15 protein in the sample. Generally, the sample is a biological 
sample; preferably, the biological sample is from an individual; and 
typically, the individual is suspected of having cancer. 

The present invention is directed toward a kit for 
detecting TADG-15 mRNA, comprising: an oligonucleotide probe, 
wherein the probe is specific for TADG-15. The kit may further 
comprise: a label with which to label the probe; and means for 
detecting the label. The present invention is additionally directed 
toward a kit for detecting TADG-15 protein, comprising: an antibody 
which is specific for TADG-15 protein or a fragment thereof. The kit 
may further comprise: means to detect the antibody. 

The present invention is directed toward a antibody 
which is specific for TADG-15 protein or a fragment thereof. 

The present invention is directed toward a method of 
screening for compounds that inhibit TADG-15, comprising the steps 
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of: (a) contacting a sample containing TADG-15 protein with a 
compound; and (b) assaying for TADG-15 protease activity. 
Typically, a decrease in the TADG-15 protease activity in the 
presence of the compound relative to TADG-15 protease activity in 
the absence of the compound is indicative of a compound that 
inhibits TADG-15. 

The present invention is directed toward a method of 
inhibiting expression of TADG-15 in a cell, comprising the step of: (a) 
introducing a vector expressing TADG-15 antisense mRNA into a cell, 
which hybridizes to endogenous TADG-15 mRNA, thereby inhibiting 
expression of TADG-15 in the cell. Generally, the inhibition of TADG- 
15 expression is for treating cancer. 

The present invention is directed toward a method of 
inhibiting a TADG-15 protein in a cell, comprising the step of: (a) 
introducing an antibody specific for a TADG-15 protein or a fragment 
thereof into a cell, which inhibits the TADG-15 protein. Generally, 
the inhibition of the TADG-15 protein is for treating cancer. 

The present invention is directed toward a method of 
targeted therapy to an individual, comprising the step of: (a) 
administering a compound having a targeting moiety and a 
therapeutic moiety to an individual, wherein the targeting moiety is 
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specific for TADG-15. Representative targeting moiety are an 
antibody specific for TADG-15 and a ligand or ligand binding domain 
{e.g., CUB, LDLR, protease and extracellular) that binds TADG-15. 
Likewise, a representative therapeutic moiety is a radioisotope, a 
5 toxin, a chemotherapeutic agent and immune stimulants. Typically, 
the above-described method is useful when the individual suffers 
from ovarian cancer, breast cancer or cancers of the prostate, lung, 
colon and cervix. 

The present invention is directed toward a method of 
10 diagnosing cancer in an individual, comprising the steps of: (a) 
obtaining a biological sample from an individual; and (b) detecting 
TADG-15 in the sample. Generally, the presence of TADG-15 in the 
sample is indicative of the presence of carcinoma in the individual, 
and the absence of TADG-15 in the sample is indicative of the 
15 absence of carcinoma in the individual. Generally, the biological 
sample is blood, ascites fluid, urine, tears, saliva or interstitial fluid. 
Typical means of detecting TADG-15 are by Northern blot, Western 
blot, PCR, dot blot, ELIZA, radioimmunoassay, DNA chips or tumor cell 
labeling. This method may be useful in diagnosing cancers such as 
20 ovarian, breast and other cancers in which TADG-15 is 
overexpressed, such as lung, prostate and colon cancers. 



The present invention is also directed to an antisense 
oligonucleotide having the nucleotide sequence complementary to a 
TADG-15 mRNA sequence. The present invention is also directed to a 
composition comprising such an antisense oligonucleotide according 
and a physiologically acceptable carrier therefore. 

The present invention is also directed to a method of 
treating a neoplastic state in an individual syndrome in an individual 
in need of such treatment, comprising the step of administering to 
said individual an effective dose of an antisense oligonucleotide of. 
Preferably, the neoplastic state is selected from the group consisting 
of from ovarian cancer, breast cancer, lung cancer, prostate cancer, 
colon cancer and other cancers in which TADG-15 is overexpressed. 
For such therapy, the oligonucleotides alone or in combination with 
other anti-neoplastic agents can be formulated for a variety of modes 
of administration, including systemic, topical or localized 
administration. Techniques and formulations generally can be found 
in Remington's Pharmaceutical Sciences, Mack Publishing Co., Easton, 
Pa. The oligonucleotide active ingredient is generally combined with 
a pharamceutically accceptable carrier such as a diluent or excipient 
which can include fillers, extenders, binders, wetting agents, 
disintergrants, surface active agents or lubricants, depending on the 
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nature of the mode of administration and dosage forms. Typical 
dosage forms include tablets, powders, liquid preparations including 
suspensions, emulsions, and solutions, granules, capsules and 
suppositories, as well as liquid preparations for injections, including 
5 liposome preparations. 

For systemic administration, injection is preferred, 
including intramuscular, intravenous, intraperitoneal and 
subcutaneous. For injection, the oligonucleotides of the invention are 
formulated in liquid solutions, preferably in physiologically 

10 compatible buffers. In addition, the oligonucleotides can be 
formulated in solid form and redissolved or suspended immediately 
prior to use. Lyophilized forms are also incldued. Dosages that can 
be used for systemic administration preferably range from about 
0.01 mg/kg to 50 mg/kg administered once or twice per day. 

15 However, different dosing schedules can be utilized depending on (1) 
the potency of an individual oligonucleotide at inhibiting the activity 
of its target DNA, (2) the severity or extent of the pathological 
disease state, or (3) the pharmacokinetic behavior of a given 
oligonucleotide. 

20 The present invention is directed toward a method of 

vaccinating an individual against TADG-15 overexpression, 
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comprising the steps of: (a) inoculating an individual with a TADG-15 
protein or fragment thereof which lacks TADG-15 protease activity. 
The inoculation with the TADG-15 protein or fragment thereof elicits 
an immune response in the individual, thereby vaccinating the 
5 individual against TADG-15. The vaccination with TADG-15 
described herein is intended for an individual who has cancer, is 
suspected of having cancer or is at risk of getting cancer. Generally, 
the TADG-15 fragment useful for vaccinating an individual are 9- 
residue fragments up to 20-residue fragments, with preferred 9- 
10 residue fragments shown in SEQIDNos. 2, 19, 20, 21, 29, 39, 49, 50, 
59, 79, 80, 81, 82, 83, 84, 89 and 90. 

The present invention is directed toward a method of 
producing immune-activated cells directed toward TADG-15, 
comprising the steps of: exposing dendritic cells to a TADG-15 protein 
15 or fragment thereof that lacks TADG-15 protease activity, wherein 
exposure to the TADG-15 protein or fragment thereof activates the 
dendritic cells, thereby producing immune-activated cells directed 
toward TADG-15. Representative immune-activated cells are B-cells, 
T-cells and dendrites. Generally, the TADG-15 fragment is a 9- 
20 residue fragment up to a 20-residue fragment, with preferable 9- 
residue fragments shown in SEQ ID Nos. 2, 19, 20, 21, 29, 39, 49, 50, 
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59, 79, 80, 81, 82, 83, 84, 89 and 90. Preferably, the dendritic cells 
are isolated from an individual prior to exposure, and the activated 
dendritic cells reintroduced into the individual subsequent to 
exposure. Typically, the individual for which this method may apply 
5 has cancer, is suspected of having cancer or is at risk of getting 
cancer. 

The present invention is directed toward an immunogenic 
composition, comprising an immunogenic fragment of a TADG-15 
protein and an appropriate adjuvant. Generally, the fragment is a 9 - 

10 residue fragment up to a 20-residue fragment, with preferred 9- 
residue fragments shown in SEQID Nos. 2, 19, 20, 21, 29, 39, 49, 50, 
59, 79, 80, 81, 82, 83, 84, 89 and 90. 

In accordance with the present invention there may be 
employed conventional molecular biology, microbiology, and 

15 recombinant DNA techniques within the skill of the art. Such 
techniques are explained fully in the literature. See, e.g., Maniatis, 
Fritsch & Sambrook, "Molecular Cloning: A Laboratory Manual 
(1982); "DNA Cloning: A Practical Approach," Volumes I and II (D.N. 
Glover ed. 1985); "Oligonucleotide Synthesis" (M.J. Gait ed. 1984); 

20 "Nucleic Acid Hybridization" (B.D. Hames & S.J. Higgins eds. 1985); 
"Transcription and Translation" (B.D. Hames & S.J. Higgins eds. 1984); 
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"Animal Cell Culture" (R.I. Freshney, ed. 1986); "Immobilized Cells 
And Enzymes" (IRL Press, 1986); B. Perbal, "A Practical Guide To 
Molecular Cloning" (1984). Therefore, if appearing herein, the 
following terms shall have the definitions set out below. 

As used herein, the term "cDNA" shall refer to the DNA 
copy of the mRNA transcript of a gene. 

As used herein, the term "derived amino acid sequence" 
shall mean the amino acid sequence determined by reading the 
triplet sequence of nucleotide bases in the cDNA. 

As used herein the term "screening a library" shall refer 
to the process of using a labeled probe to check whether, under the 
appropriate conditions, there is a sequence complementary to the 
probe present in a particular DNA library. In addition, "screening a 
library" could be performed by PCR. 

As used herein, the term "PCR" refers to the polymerase 
chain reaction that is the subject of U.S. Patent Nos. 4,683,195 and 
4,683,202 to Mullis, as well as other improvements now known in 
the art. 

The amino acid described herein are preferred to be in 
the "L" isomeric form. However, residues in the "D" isomeric form 
can be substituted for any L-amino acid residue, as long as the 



desired functional property of immunoglobulin-binding is retained 
by the polypeptide. NH 2 refers to the free amino group present at 
the amino terminus of a polypeptide. COOH refers to the free 
carboxy group present at the carboxy terminus of a polypeptide. In 
5 keeping with standard polypeptide nomenclature, / Biol. Chem., 
243:3552-59 (1969), abbreviations for amino acid residues are used 
as in customary in the art. 

It should be noted that all amino-acid residue sequences 
are represented herein by formulae whose left and right orientation 
10 is in the conventional direction of amino-terminus to carboxy- 
terminus. Furthermore, it should be noted that a dash at the 
beginning or end of an amino acid residue sequence indicates a 
peptide bond to a further sequence of one or more amino-acid 
residues. 

15 A "replicon" is any genetic element (e.g., plasmid, 

chromosome, virus) that functions as an autonomous unit of DNA 
replication in vivo; i.e., capable of replication under its own control. 

A "vector" is a replicon, such as plasmid, phage or cosmid, 
to which another DNA segment may be attached so as to bring about 

20 the replication of the attached segment. A "vector" may further be 
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defined as a replicable nucleic acid construct, e.g., a plasmid or viral 
nucleic acid. 

A "DNA molecule" refers to the polymeric form of 
deoxyribonucleotides (adenine, guanine, thymine, or cytosine) in its 
5 either single- stranded form or as a double- stranded helix. This term 
refers only to the primary and secondary structure of the molecule, 
and does not limit it to any particular tertiary forms. Thus, this term 
includes double- stranded DNA found, inter alia, in linear DNA 
molecules {e.g., restriction fragments), viruses, plasmids, and 
10 chromosomes. The structure is discussed herein according to the 
normal convention of giving only the sequence in the 5' to 3' 
direction along the nontranscribed strand of DNA {i.e., the strand 
having a sequence homologous to the mRNA). 

An expression vector is a replicable construct in which a 
15 nucleic acid sequence encoding a polypeptide is operably linked to 
suitable control sequences capable of effecting expression of the 
polypeptide in a cell. The need for such control sequences will vary 
depending upon the cell selected and the transformation method 
chosen. Generally, control sequences include a transcriptional 
20 promoter and/or enhancer, suitable mRNA ribosomal binding sites, 
and sequences which control the termination of transcription and 
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translation. Methods which are well known to those skilled in the art 
can be used to construct expression vectors containing appropriate 
transcriptional and translational control signals. See, for example, 
techniques described in Sambrook et al., 1989, Molecular Cloning: A 
5 Laboratory Manual (2nd Ed.), Cold Spring Harbor Press, N.Y. A gene 
and its transcription control sequences are defined as being 
"operably linked" if the transcription control sequences effectively 
control transcription of the gene. Vectors of the invention include, 
but are not limited to, plasmid vectors and viral vectors. Preferred 

10 viral vectors of the invention, are those derived from retroviruses, 
adenovirus, adeno-associated virus, SV40 virus, or herpes viruses. 
In general, expression vectors contain promoter sequences which 
facilitate the efficient transcription of the inserted DNA fragment and 
are used in connection with the host. The expression vector typically 

15 contains an origin of replication, promoter(s), terminator(s), as well 
as specific genes which are capable of providing phenotypic selection 
in transformed cells. The transformed hosts can be fermented and 
cultured according to means known in the art to achieve optimal cell 
growth. 

20 An "origin of replication" refers to those DNA sequences 

that participate in DNA synthesis. 



A DNA "coding sequence" is a double-stranded DNA 
sequence which is transcribed and translated into a polypeptide in 
vivo when placed under the control of appropriate regulatory 
sequences. The boundaries of the coding sequence are determined 
5 by a start codon at the 5' (amino) terminus and a translation stop 
codon at the 3' (carboxyl) terminus. A coding sequence can include, 
but is not limited to, prokaryotic sequences, cDNA from eukaryotic 
mRNA, genomic DNA sequences from eukaryotic {e.g., mammalian) 
DNA, and even synthetic DNA sequences. A polyadenylation signal 
10 and transcription termination sequence will usually be located 3' to 
the coding sequence. 

Transcriptional and translational control sequences are 
DNA regulatory sequences, such as promoters, enhancers, 
polyadenylation signals, terminators, and the like, that provide for 
15 the expression of a coding sequence in a host cell. 

A "promoter sequence" is a DNA regulatory region 
capable of binding RNA polymerase in a cell and initiating 
transcription of a downstream (3* direction) coding sequence. For 
purposes of defining the present invention, the promoter sequence is 
20 bounded at its 3' terminus by the transcription initiation site and 
extends upstream (5' direction) to include the minimum number of 
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bases or elements necessary to initiate transcription at levels 
detectable above background. Within the promoter sequence will be 
found a transcription initiation site, as well as protein binding 
domains (consensus sequences) responsible for the binding of RNA 
5 polymerase. Eukaryotic promoters often, but not always, contain 
"TATA" boxes and "CAT" boxes. Prokaryotic promoters typically 
contain Shine-Dalgarno ribosome-binding sequences in addition to 
the -10 and -35 consensus sequences. 

An "expression control sequence" is a DNA sequence that 

10 controls and regulates the transcription and translation of another 
DNA sequence. A coding sequence is "under the control" of 
transcriptional and translational control sequences in a cell when 
RNA polymerase transcribes the coding sequence into mRNA, which 
is then translated into the protein encoded by the coding sequence. 

15 A "signal sequence" can be included near the coding 

sequence. This sequence encodes a signal peptide, N-terminal to the 
polypeptide, that communicates to the host cell to direct the 
polypeptide to the cell surface or secrete the polypeptide into the 
media, and this signal peptide is clipped off by the host cell before 

20 the protein leaves the cell. Signal sequences can be found associated 
with a variety of proteins native to prokaryotes and eukaryotes. 



As used herein, the terms "restriction endonucleases" and 
"restriction enzymes" refer to enzymes, each of which cut double- 
stranded DNA at or near a specific nucleotide sequence. 

A cell has been "transformed" by exogenous or 
5 heterologous DNA when such DNA has been introduced inside the 
cell. The transforming DNA may or may not be integrated 
(covalently linked) into the genome of the cell. In prokaryotes, 
yeast, and mammalian cells for example, the transforming DNA may 
be maintained on an episomal element such as a plasmid. With 
10 respect to eukaryotic cells, a stably transformed cell is one in which 
the transforming DNA has become integrated into a chromosome so 
that it is inherited by daughter cells through chromosome replication. 
This stability is demonstrated by the ability of the eukaryotic cell to 
establish cell lines or clones comprised of a population of daughter 
15 cells containing the transforming DNA. A "clone" is a population of 
cells derived from a single cell or ancestor by mitosis. A "cell line" is 
a clone of a primary cell that is capable of stable growth in vitro for 
many generations. 

Two DNA sequences are "substantially homologous" when 
20 at least about 75% (preferably at least about 80%, and most 
preferably at least about 90% or 95%) of the nucleotides match over 
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the defined length of the DNA sequences. Sequences that are 
substantially homologous can be identified by comparing the 
sequences using standard software available in sequence data banks, 
or in a Southern hybridization experiment under, for example, 
5 stringent conditions as defined for that particular system. Defining 
appropriate hybridization conditions is within the skill of the art. 
See, e.g., Maniatis et al., supra; DNA Cloning, Vols. I & II, supra; 
Nucleic Acid Hybridization, supra. 

A "heterologous" region of the DNA construct is an 

10 identifiable segment of DNA within a larger DNA molecule that is not 
found in association with the larger molecule in nature. Thus, when 
the heterologous region encodes a mammalian gene, the gene will 
usually be flanked by DNA that does not flank the mammalian 
genomic DNA in the genome of the source organism. In another 

15 example, coding sequence is a construct where the coding sequence 
itself is not found in nature (e.g., a cDNA where the genomic coding 
sequence contains introns, or synthetic sequences having codons 
different than the native gene). Allelic variations or naturally- 
occurring mutational events do not give rise to a heterologous region 

20 of DNA as defined herein. 



The labels most commonly employed for these studies 
are radioactive elements, enzymes, chemicals which fluoresce when 
exposed to ultraviolet light, and others. A number of fluorescent 
materials are known and can be utilized as labels. These include, for 
5 example, fluorescein, rhodamine, auramine, Texas Red, AMCA blue 
and Lucifer Yellow. A particular detecting material is anti-rabbit 
antibody prepared in goats and conjugated with fluorescein through 
an isothiocyanate. Proteins can also be labeled with a radioactive 
element or with an enzyme. The radioactive label can be detected by 

10 any of the currently available counting procedures. The preferred 
isotope may be selected from m, 32 P, 35 S, 36Q, 51Q-, 57 Co , 58Co, 
59p e , 90y, 125L 1311, and i 86 Re. Enzyme labels are likewise useful, and 
can be detected by any of the presently utilized colorimetric, 
spectrophotometric, fluorospectrophotometric, amperometric or 

15 gasometric techniques. The enzyme is conjugated to the selected 
particle by reaction with bridging molecules such as carbodiimides, 
diisocyanates, glutaraldehyde and the like. Many enzymes which can 
be used in these procedures are known and can be utilized. The 
preferred are peroxidase, ^-glucuronidase, p-D-glucosidase, p-D- 

20 galactosidase, urease, glucose oxidase plus peroxidase and alkaline 
phosphatase. U.S. Patent Nos. 3,654,090, 3,850,752, and 4,016,043 



are referred to by way of example for their disclosure of alternate 
labeling material and methods. 

A particular assay system developed and utilized in the 
art is known as a receptor assay. In a receptor assay, the material to 
5 be assayed is appropriately labeled and then certain cellular test 
colonies are inoculated with a quantitiy of both the label after which 
binding studies are conducted to determine the extent to which the 
labeled material binds to the cell receptors. In this way, differences 
in affinity between materials can be ascertained. 

10 An assay useful in the art is known as a "cis/trans" assay. 

Briefly, this assay employs two genetic constructs, one of which is 
typically a plasmid that continually expresses a particular receptor of 
interest when transfected into an appropriate cell line, and the 
second of which is a plasmid that expresses a reporter such as 

15 luciferase, under the control of a receptor/ligand complex. Thus, for 
example, if it is desired to evaluate a compound as a ligand for a 
particular receptor, one of the plasmids would be a construct that 
results in expression of the receptor in the chosen cell line, while the 
second plasmid would possess a promoter linked to the luciferase 

20 gene in which the response element to the particular receptor is 
inserted. If the compound under test is an agonist for the receptor, 



the ligand will complex with the receptor, and the resulting complex 
will bind the response element and initiate transcription of the 
luciferase gene. The resulting chemiluminescence is then measured 
photometrically, and dose response curves are obtained and 
5 compared to those of known ligands. The foregoing protocol is 
described in detail in U.S. Patent No. 4,981,784. 

As used herein, the term "host" is meant to include not 
only prokaryotes but also eukaryotes such as yeast, plant and animal 
cells. A recombinant DNA molecule or gene which encodes a human 

10 TADG-15 protein of the present invention can be used to transform a 
host using any of the techniques commonly known to those of 
ordinary skill in the art. Especially preferred is the use of a vector 
containing coding sequences for the gene which encodes a human 
TADG-15 protein of the present invention for purposes of prokaryote 

15 transformation. Prokaryotic hosts may include E. coli, S. 
tymphimurium, Serratia marcescens and Bacillus subtilis. Eukaryotic 
hosts include yeasts such as Pichia pastoris, mammalian cells and 
insect cells. 

The invention includes a substantially pure DNA encoding 
20 a TADG-15 protein, a DNA strand which will hybridize at high 
stringency to a probe containing a sequence of at least 1 5 



consecutive nucleotides of (SEQ ID No. 1). The protein encoded b y 
the DNA of this invention may share at least 80% sequence identity 
(preferably 85%, more preferably 90%, and most preferably 95%) 
with the amino acids listed in Figures 3 and 4 (SEQ ID No. 2). More 
5 preferably, the DNA includes the coding sequence of the nucleotides 
of Figure 2 (SEQ ID No. 1), or a degenerate variant of such a sequence. 
This invention also includes a substantially pure DNA containing a 
sequence of at least 15 consecutive nucleotides (preferably 20, more 
preferably 30, even more preferably 50, and most preferably all) of 

10 the region from nucleotides 1 to 3147 of the nucleotides shown in 
Figure 2 (SEQ ID No. 1). 

By "substantially pure DNA" is meant DNA that is not part 
of a milieu in which the DNA naturally occurs, by virtue of separation 
(partial or total purification) of some or all of the molecules of that 

15 milieu, or by virtue of alteration of sequences that flank the claimed 
DNA. The term therefore includes, for example, a recombinant DNA 
which is incorporated into a vector, into an autonomously replicating 
plasmid or virus, or into the genomic DNA of a prokaryote or 
eukaryote; or which exists as a separate molecule (e.g., a cDNA or a 

20 genomic or cDNA fragment produced by polymerase chain reaction 
(PCR) or restriction endonuclease digestion) independent of other 



sequences. It also includes a recombinant DNA which is part of a 
hybrid gene encoding additional polypeptide sequence, e.g., a fusion 
protein. Also included is a recombinant DNA which includes a 
portion of the nucleotides listed in Figure 2 (SEQ ID No. 1) and which 
5 encodes an alternative splice variant of TADG-15. 

By a "substantially pure protein" is meant a protein 
which has been separated from at least some of those components 
which naturally accompany it. Typically, the protein is substantially 
pure when it is at least 60% (by weight) free from the proteins and 

10 other naturally-occurring organic molecules with which it is 
naturally associated in vivo. Preferably, the purity of the 
preparation (by weight) is at least 75%, more preferably at least 90%, 
and most preferably at least 99%. A substantially pure TADG-15 
protein may be obtained, for example, by extraction from a natural 

15 source; by expression of a recombinant nucleic acid encoding a TADG- 
15 polypeptide; or by chemically synthesizing the protein. Purity 
can be measured by any appropriate method, e.g., column 
chromatography, such as immunoaffinity chromatography using an 
antibody specific for TADG-15, polyacryl amide gel electrophoresis, or 

20 HPLC analysis. A protein is substantially free of naturally associated 
components when it is separated from at least some of those 
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contaminants which accompany it in its natural state. Thus, a protein 
which is chemically synthesized or produced in a cellular system 
different from the cell from which it naturally originates will be, by 
definition, substantially free from its naturally associated 
5 components. Accordingly, substantially pure proteins include 
eukaryotic proteins synthesized in E. coli, other prokaryotes, or any 
other organism in which they do not naturally occur. 

The term "oligonucleotide", as used herein, is defined as a 
molecule comprised of two or more ribonucleotides, preferably more 
c 10 than three. Its exact size will depend upon many factors, which, in 
j=± turn, depend upon the ultimate function and use of the 

H oligonucleotide. The term "primer", as used herein, refers to an 

L oligonucleotide, whether occurring naturally (as in a purified 

m restriction digest) or produced synthetically, and which is capable of 

*S 15 initiating synthesis of a strand complementary to a nucleic acid when 
placed under appropriate conditions, i.e., in the presence of 
nucleotides and an inducing agent, such as a DNA polymerase, and at 
a suitable temperature and pH. The primer may be either single- 
stranded or double-stranded and must be sufficiently long to prime 
20 the synthesis of the desired extension product in the presence of the 
inducing agent. The exact length of the primer will depend upon 
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many factors, including temperature, sequence and/or homology of 
primer and the method used. For example, in diagnostic applications, 
the oligonucleotide primer typically contains 15-25 or more 
nucleotides, depending upon the complexity of the target sequence, 
5 although it may contain fewer nucleotides. 

The primers herein are selected to be "substantially" 
complementary to particular target DNA sequences. This means that 
the primers must be sufficiently complementary to hybridize with 
their respective strands. Therefore, the primer sequence need not 
10 reflect the exact sequence of the template. For example, a non- 
complementary nucleotide fragment (i.e., containing a restriction 
site) may be attached to the 5' end of the primer, with the remainder 
of the primer sequence being complementary to the strand. 
Alternatively, non-complementary bases or longer sequences can be 
15 interspersed into the primer, provided that the primer sequence has 
sufficient complementary with the sequence to hybridize therewith 
and form the template for synthesis of the extension product. 

The probe to which the DNA of the invention hybridizes 
preferably consists of a sequence of at least 20 consecutive 
20 nucleotides, more preferably 40 nucleotides, even more preferably 
50 nucleotides, and most preferably 100 nucleotides or more (up to 
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100%) of the coding sequence of the nucleotides listed in Figure 2 
(SEQ ID No. 1) or the complement thereof. Such a probe is useful for 
detecting expression of TADG-15 in a cell by a method including the 
steps of (a) contacting mRNA obtained from the cell with a labeled 
5 TADG-15 hybridization probe; and (b) detecting hybridization of the 
probe with the mRNA. 

By "high stringency" is meant DNA hybridization and 
wash conditions characterized by high temperature and low salt 
concentration, e.g., wash conditions of 65°C at a salt concentration of 

10 approximately 0.1X SSC, or the functional equivalent thereof. For 
example, high stringency conditions may include hybridization at 
about 42°C in the presence of about 50% formamide; a first wash at 
about 65°C with about 2X SSC containing 1% SDS; followed by a 
second wash at about 65°C with about 0.1X SSC. 

15 The DNA may have at least about 70% sequence identity 

to the coding sequence of the nucleotides listed in Figure 2 (SEQ ID 
No. 1), preferably at least 75% (e.g., at least 80%); and most 
preferably at least 90%. The identity between two sequences is a 
direct function of the number of matching or identical positions. 

20 When a position in both of the two sequences is occupied by the 
same monomeric subunit, e.g., if a given position is occupied by an 
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adenine in each of two DNA molecules, then they are identical at that 
position. For example, if 7 positions in a sequence 10 nucleotides in 
length are identical to the corresponding positions in a second 10- 
nucleotide sequence, then the two sequences have 70% sequence 
5 identity. The length of comparison sequences will generally be at 
least 50 nucleotides, preferably at least 60 nucleotides, more 
preferably at least 75 nucleotides, and most preferably 
100 nucleotides. Sequence identity is typically measured using 
sequence analysis software (e.g., Sequence Analysis Software 
10 Package of the Genetics Computer Group (GCG), University of 
Wisconsin Biotechnology Center, 1710 University Avenue, Madison, 
WI 53705). 

The present invention comprises a vector comprising a 
DNA sequence which encodes a human TADG-15 protein, wherein 

15 said vector is capable of replication in a host, and comprises, in 
operable linkage: a) an origin of replication; b) a promoter; and c) a 
DNA sequence coding for said TADG-15 protein. Preferably, the 
vector of the present invention contains a portion of the DNA 
sequence shown in SEQ ID No. 1. Vectors may be used to amplify 

20 and/or express nucleic acid encoding a TADG-15 protein or fragment 
thereof. 



In addition to substantially full-length proteins, the 
invention also includes fragments (e.g., antigenic fragments) of the 
TADG-15 protein (SEQ ID No. 2). As used herein, "fragment," as 
applied to a polypeptide, will ordinarily be at least 6 residues, more 
5 typically at least 9-12 residues, and preferably at least 13- 
20 residues in length, but less than the entire, intact sequence. 
Alternatively, a fragment may be an individual domain of 20-120 
residues from SEQ ID No. 2. Fragments of the TADG-15 protein can 
£S& be generated by methods known to those skilled in the art, e.g., by 

jg 10 enzymatic digestion of naturally occurring or recombinant TADG-15 
M 1 protein, by recombinant DNA techniques using an expression vector 

{71 that encodes a defined fragment of TADG-15, or by chemical 

M= synthesis. The ability of a candidate fragment to exhibit a 

^ characteristic of TADG-15 (e.g., binding to an antibody specific for 

*~ 15 TADG-15) can be assessed by methods described herein. Purified 
TADG-15 or antigenic fragments of TADG-15 can be used to generate 
new antibodies or to test existing antibodies (e.g., as positive controls 
in a diagnostic assay) by employing standard protocols known to 
those skilled in the art. Included in this invention is polyclonal 
20 antisera generated by using TADG-15 or a fragment of TADG-15 as 
the immunogen in, e.g., rabbits. Standard protocols for monoclonal 
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and polyclonal antibody production known to those skilled in this art 
are employed. The monoclonal antibodies generated by this 
procedure can be screened for the ability to identify recombinant 
TADG-15 cDNA clones, and to distinguish them from other cDNA 
5 clones. 

Further included in this invention are TADG-15 proteins 
which are encoded, at least in part, by portions of SEQ ID No. 2, e.g., 
products of alternative mRNA splicing or alternative protein 
processing events, or in which a section of TADG-15 sequence has 

10 been deleted. The fragment, or the intact TADG-15 polypeptide, may 
be covalently linked to another polypeptide, e.g., one which acts as a 
label, a ligand or a means to increase antigenicity. 

The invention also includes a polyclonal or monoclonal 
antibody which specifically binds to TADG-15. The invention 

15 encompasses not only an intact monoclonal antibody, but also an 
immunologically-active antibody fragment, e.g., a Fab or (Fab) 2 
fragment; an engineered single chain Fv molecule; or a chimeric 
molecule, e.g., an antibody which contains the binding specificity of 
one antibody, e.g., of murine origin, and the remaining portions of 

20 another antibody, e.g., of human origin. 



In one embodiment, the antibody, or a fragment thereof, 
may be linked to a toxin or to a detectable label, e.g., a radioactive 
label, non-radioactive isotopic label, fluorescent label, 
chemiluminescent label, paramagnetic label, enzyme label, or 
5 colorimetric label. Examples of suitable toxins include diphtheria 
toxin, Pseudomonas exotoxin A, ricin, and cholera toxin. Examples of 
suitable enzyme labels include malate hydrogenase, staphylococcal 
nuclease, delta-5-steroid isomerase, alcohol dehydrogenase, alpha- 
glycerol phosphate dehydrogenase, triose phosphate isomerase, 
10 peroxidase, alkaline phosphatase, asparaginase, glucose oxidase, 
beta-galactosidase, ribonuclease, urease, catalase, glucoses- 
phosphate dehydrogenase, glucoamylase, acetylcholinesterase, etc. 
Examples of suitable radioisotopic labels include 3 H, 125 I, 131 I, 32 P, 35 S, 
14 C, etc. 

15 Paramagnetic isotopes for purposes of in vivo diagnosis 

can also be used according to the methods of this invention. There 
are numerous examples of elements that are useful in magnetic 
resonance imaging. For discussions on in vivo nuclear magnetic 
resonance imaging, see, for example, Schaefer et al., (1989) JACC 14, 

20 472-480; Shreve et al., (1986) Magn. Reson. Med. 3, 336-340; Wolf, G 
L., (1984) Physiol. Chem. Phys. Med. NMR 16, 93-95; Wesbey et al., 



(1984) Physiol. Chem. Phys. Med. NMR 16, 145-155; Runge et al., 
(1984) Invest. Radiol. 19, 408-415. Examples of suitable fluorescent 
labels include a fluorescein label, an isothiocyalate label, a 
rhodamine label, a phycoerythrin label, a phycocyanin label, an 
5 allophycocyanin label, an ophthaldehyde label, a fluorescamine label, 
etc. Examples of chemiluminescent labels include a luminal label, an 
isoluminal label, an aromatic acridinium ester label, an imidazole 
label, an acridinium salt label, an oxalate ester label, a luciferin label, 
a lucif erase label, an aequorin label, etc. 

10 Those of ordinary skill in the art will know of other 

suitable labels which may be employed in accordance with the 
present invention. The binding of these labels to antibodies or 
fragments thereof can be accomplished using standard techniques 
commonly known and used by those of ordinary skill in the art. 

15 Typical techniques are described by Kennedy et al., (1976) Clin. 
Chim. Acta 70, 1-31; and Schurs et al., (1977) Clin. Chim. Acta 81, 1- 
40. Coupling techniques mentioned in the latter are the 
glutaraldehyde method, the periodate method, the dimaleimide 
method, the m-maleimidobenzyl-N-hydroxy-succinimide ester 

20 method. All of these methods are incorporated by reference herein. 
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Also within the invention is a method of detecting TADG- 
15 protein in a biological sample, which includes the steps of 
contacting the sample with the labeled antibody, e.g., radioactively 
tagged antibody specific for TADG-15, and determining whether the 
antibody binds to a component of the sample. Antibodies to the 
TADG-15 protein can be used in an immunoassay to detect increased 
levels of TADG-15 protein expression in tissues suspected of 
neoplastic transformation. These same uses can be achieved with 
Northern blot assays and analyses. 

As described herein, the invention provides a number of 
diagnostic advantages and uses. For example, the TADG-15 protein is 
useful in diagnosing cancer in different tissues since this protein is 
highly overexpressed in tumor cells. Antibodies (or antigen-binding 
fragments thereof) which bind to an epitope specific for TADG-15, 
are useful in a method of detecting TADG-15 protein in a biological 
sample for diagnosis of cancerous or neoplastic transformation. This 
method includes the steps of obtaining a biological sample {e.g., cells, 
blood, plasma, tissue, etc.) from a patient suspected of having cancer, 
contacting the sample with a labeled antibody {e.g., radioactively 
tagged antibody) specific for TADG-15, and detecting the TADG-15 
protein using standard immunoassay techniques such as an ELISA. 
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Antibody binding to the biological sample indicates that the sample 
contains a component which specifically binds to an epitope within 
TADG-15. 

Likewise, a standard Northern blot assay can be used to 
5 ascertain the relative amounts of TADG-15 mRNA in a cell or tissue 
obtained from a patient suspected of having cancer, in accordance 
with conventional Northern hybridization techniques known to those 
of ordinary skill in the art. This Northern assay uses a hybridization 
probe, e.g., radiolabeled TADG-15 cDNA, either containing the full- 
10 length, single stranded DNA having a sequence complementary to 
SEQID No. 1 (Figure 2), or a fragment of that DNA sequence at least 
20 (preferably at least 30, more preferably at least 50, and most 
preferably at least 100 consecutive nucleotides in length). The DNA 
hybridization probe can be labeled by any of the many different 
15 methods known to those skilled in this art. 

The following examples are given for the purpose of 
illustrating various embodiments of the invention and are not meant 
to limit the present invention in any fashion. 
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EXAMPLE 1 



Tissue collection and storage 

Upon patient hysterectomy, bilateral salpingo- 
oophorectomy, or surgical removal of neoplastic tissue, the specimen 
is retrieved and placed on ice. The specimen was then taken to the 
resident pathologist for isolation and identification of specific tissue 
samples. Finally, the sample was frozen in liquid nitrogen, logged 
into the laboratory record and stored at -80°C. 

Additional specimens were frequently obtained from the 
Cooperative Human Tissue Network (CHTN). These samples were 
prepared by the CHTN and shipped on dry ice. Upon arrival, these 
specimens (e.g., blood (serum), urine, saliva, tears and insterstitial 
fluid) were logged into the laboratory record and stored at -80°C. 
Participation of the following divisions of the Cooperative Human 
Tissue Network (CHTN) in providing tumor tissues is acknowledged: 
Western Division, Case Western Reserve University, (Cleveland, OH); 
Midwestern Division, Ohio state University, (Columbus, OH); Eastern 
Division, NDRI, (Philadelphia, PA); Pediatric Division, Children's 
Hospital, (Columbus, OH); Southern Division, University of Alabama at 
Birmingham, (Birmingham, AL). 



EXAMPLE 2 



mRNA isolation and cDNA synthesis 

Forty-one ovarian tumors (10 low malignant potential 
5 tumors and 31 carcinomas) and 10 normal ovaries were obtained 
from surgical specimens and frozen in liquid nitrogen. The human 
ovarian carcinoma cell lines SW626 and CAOV3, and the human 
breast carcinoma cell lines MDA-MB-231 and MDA-MB-435S, were 
purchased from the American Type Culture Collection (Rockville, 
10 MD). Cells were cultured to sub-confluency in Dulbecco's modified 
Eagle's medium supplemented with 10% (v/v) fetal bovine serum 

and antibiotics. 

Messenger RNA (mRNA) isolation was performed 

according to the manufacturer's instructions using the Mini 
15 RiboSep™ Ultra mRNA Isolation Kit purchased from Becton 

Dickinson. In this procedure, polyA + mRNA was isolated directly 

from the tissue lysate using the affinity chromatography media 

oligo(dT) cellulose. The amount of mRNA recovered was quantitated 

by UV spectrophotometry. 
20 First-strand complementary DNA (cDNA) was synthesized 

using 5.0 \ig of mRNA and either random hexamer or oligo(dT) 



primers according to the manufacturer's protocol utilizing a first 
strand synthesis kit obtained from CLONIECH (Palo Alto, CA). The 
purity of the cDNA was evaluated by PCR using primers specific for 
the p53 gene. These primers span an intron such that pure cDNA can 
be distinguished from cDNA that is contaminated with genomic DNA. 



EXAMPLE 3 

PCR with redundant primers, cloning of TADG-15 cDNA, T- vector 
li gation and transformations and DNA sequencing 
Redundant primers, 
forward 5'-TGGGTIGTIACIGCIGCICA(C/T)TG-3' (SEQ ID No. 11) 
and reverse 5'-A(A/G)IGGICCICCI(C/G)(T/A)(A/G)TCICC-3' (SEQ ID 
No. 12), corresponding to the amino acids surrounding the catalytic 
triad for serine proteases, were used to compare the PCR products 
from normal and carcinoma cDNAs. 

The purified PCR products were ligated into the Promega 
T-vector plasmid and the ligation products used to transform JM109 
competent cells according to the manufacturer's instructions 
(Promega). Positive colonies were cultured for amplification, the 
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plasmid DNA isolated using the Wizard™ Minipreps DNA purification 
system (Promega), and the plasmids were digested with Apal and 
Sad restriction enzymes to determine the size of the insert. 
Plasmids with inserts of the size(s) visualized by the previously 
5 described PCR product gel electrophoresis were sequenced. 

Individual colonies were cultured and plasmid DNA was 
isolated using the Wizard Miniprep DNA purification system 
(Promega). Applied Biosystems Model 373A DNA sequencing system 
was used for direct cDNA sequence determination. Utilizing a 

10 plasmid-specific primer near the cloning site, sequencing reactions 
were carried out using PRISM™ Ready Reaction Dye Deoxy™ 
terminators (Applied Biosystems) according to the manufacturer's 
instructions. Residual dye terminators were removed from the 
completed sequencing reaction using a Centri-sep™ spin column 

15 (Princeton Separation). Based upon the determined sequence, 
primers that specifically amplify the gene of interest were designed 
and synthesized. 

The original TADG-15 subclone (436bp) was randomly 
labeled and used as a probe to screen an ovarian tumor cDNA library 

20 by standard hybridization techniques. 13 The library was constructed 
in 8ZAP using mRNA isolated from the tumor cells of a stage 



Ill/grade III ovarian adenocarcinoma patient. Three overlapping 
clones were obtained which spanned 3147 nucleotides. 

5 EXAMPLE 4 

Northern blot analysis 

10 \ig mRNAs were size separated by electrophoresis 

through a 1% formaldehyde-agarose gel in 0.02 M MOPS, 0.05 M 
10 sodium acetate (pH 7.0), and 0.001 M EDTA. The mRNAs were then 
blotted to Hybond-N + nylon membrane (Amersham) by capillary 
action in 20x SSPE. The RNAs are fixed to the membrane by baking 
for 2 hours at 80°C. 32 P-labeled cDNA probes were made by Prime- 
a-Gene Labeling System (Promega). The PCR products amplified by 
15 the same primers described above were used for probes. The blots 
were prehybridized for 30 min and hybridized for 60 min at 68 °C 
with 32 P-labeled cDNA probe in ExpressHyb Hybridization Solution 
(CLONTECH). Control hybridization to determine relative gel loading 
was performed with a (3 -tubulin probe. 
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Normal human tissues; spleen, thymus, prostate, testis, 
ovary, small intestine, colon and peripheral blood leukocyte, and 
normal human fetal tissues; brain, lung, liver and kidney (Human 
Multiple Tissue Northern Blot; CLONTECH) were also examined by the 
same hybridization procedure. Additional multiple tissue northern 
(MTN) blots from CLONTECH include the Human MTN blot, the Human 
MTN II blot, the Human Fetal MTN II blot, and the Human Brain MTN 
III blot. 



EXAMPLE 5 

Western blot analysis 

Polyclonal rabbit antibody was generated by 
immunization with a poly-lysine linked multiple Ag peptide derived 
from the TADG-15 protein sequence 'LFRDWIKENTGV' (SEQ ID No. 
13). Approximately 20 jig of cell lysates were separated on a 15% 
SDS-PAGE gel and electroblotted to PVDF at 100 V for 40 min at 4°C. 
The proteins were fixed to the membrane by incubation in 50% 
MeOH for 10 min. The membrane was blocked overnight in TBS (pH 

7.8) containing 0.2% non-fat milk. Primary antibody was added to 
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the membrane at a dilution of 1:100 in 0.2% milk/TBS and incubated 
for 2 h at room temperature. The blot was washed and incubated 
with a 1:3000 dilution of alkaline-phosphatase conjugated goat anti- 
rabbit IgG (BioRad) for 1 h at room temperature. The blot was 
washed and incubated with a chemiluminescent substrate before a 
10 sec exposure to X-ray film for visualization. 



EXAMPLE 6 

Quantitative PCR 

The mRNA overexpression of TADG-15 was determined 
using a quantitative PCR. Quantitative PCR was performed. 1112 
Oligonucleotide primers were used 
for TADG-15: 

forward 5 ' - ATG AC AG AGG ATTC AGGT AC-3 ' (SEQ ID No. 14) and 
reverse 5 ' -GA AGGTG AAGTC ATTGA AGA-3 ' (SEQ ID No. 15); and 

and for p -tubulin: 

forward 5 ' -CGC ATCA ACGTGT ACT AC AA-3 ' (SEQ ID No. 16) and 
reverse 5 ' -T ACG AGCTGGTGG ACTGAG A-3 ' (SEQ ID No. 17). 

p-tubulin was utilized as an internal control. 



The PCR reaction mixture consists of cDNA derived from 
50 ng of mRNA, 5 pmol of sense and antisense primers for both the 
TADG-15 gene and the p-tubulin gene, 200 |imol of dNTPs, 5 uCi of cc- 
32 PdCTP and 0.25 units of Taq DNA polymerase with reaction buffer 
(Promega) in a final volume of 25 pi. The target sequences were 
amplified in parallel with the p -tubulin gene. Thirty cycles of PCR 
were carried out in a Thermal Cycler (Perkin Elmer Gene Amp 2400; 
Perkin-Elmer Cetus). Each cycle of PCR included 30 sec of 
denaturation at 94°C, 30 sec of annealing at 60°C and 30 sec of 
extension at 72°C. The annealing temperature varies according to the 
primers that are used in the PCR reaction. For the reactions 
involving degenerate primers, an annealing temperature of 48°C was 
used. The appropriate annealing temperature for the TADG-15- and 
p -tubulin- specific primers is 62°C. 

A portion of the PCR products were separated on 2% 
agarose gels and the radioactivity of each PCR product was 
determined by using a Phospholmager (Molecular Dynamics). In the 
present study, the expression ratio (TADG-15/p-tubulin) was used to 
evaluate gene expression and defined the value at mean ± 2SD of 
normal ovary as the cut-off value to determine overexpression. The 
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student's t test was used for comparison of the mean values of 
normal ovary and tumors. 

5 EXAMPLE 7 

Tmmunobistochemistrv 

Immunohistochemical staining was performed using a 
Vectastain Elite ABC Kit (Vector). Formalin-fixed and paraffin- 

10 embedded specimens were routinely deparaffinized and processed 
using microwave heat treatment in 0.01 M sodium citrate buffer (pH 
6.0). The specimens were incubated with normal goat serum in a 
moist chamber for 30 min. After incubation with biotinylated anti- 
rabbit IgG for 30 min, the sections were then incubated with ABC 

15 reagent (Vector) for 30 min. The final products were visualized 
using the AEC substrate system (DAKO) and sections were 
counter stained with hematoxylin before mounting. Negative controls 
were performed using normal serum instead of the primary 
antibody. 

20 
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EXAMPLE 8 



Antisense TADG-15 

TADG-15 is cloned and expressed in the opposite 
5 orientation such that an antisense RNA molecule (SEQ ID No. 18) is 
produced. For example, the antisense RNA is used to hybridize to the 
complementary RNA in the cell and thereby inhibit translation of 
TADG-15 RNA into protein. 



10 EXAMPLE 9 

Peptide ranking 

For vaccine or immune stimulation, individual 9-mers to 
11-mers were examined to rank the binding of individual peptides to 

15 the top 8 haplotypes in the general population (Parker et al., (1994)). 
The computer program used for this analyses can be found at 
<http://www-bimas.dcrt.nih.gov/molbio/hla_bind/>. Table 1 shows 
the peptide ranking based upon the predicted half-life of each 
peptide's binding to a particular HLA allele. A larger half-life 

20 indicates a stronger association with that peptide and the particular 
HLA molecule. The TADG-15 peptides that strongly bind to an HLA 



allele are putative immunogens, and are used to innoculate an 
individual against TADG-15. 



TADG-15 peptide ranking 



TABLE 1 





HLA Type 
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10 
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20 


15 
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20 
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EXAMPLE 10 



TADG-15 cDNA 

30 A screening strategy to identify proteases which are 

overexpressed in human cancer has been developed in which RT-PCR 
products amplified specifically in tumors, as compared to normal 

Si 



tissue, are examined. 9 During this effort, candidate genes were 
identified using redundant sense primers to the conserved amino 
acid histidine domain at the NH 3 end of the catalytic domain and 
antisense primers to the downstream conserved amino acid serine 
5 domain. Subcloning and sequencing the appropriate 480 base pair 
band(s) amplified in such a PCR reaction provides the basis for 
identifying the gene(s) encoding proteases(s). Among these 
amplified catalytic domains, a new serine protease gene named 
TADG-15 (tumor antigen-derived gene 15) was identified. The 

10 catalytic domain of the newly identified TADG-15 protein is similar 
to other serine proteases and specifically contains conserved amino 
acids appropriate for the catalytic domain of the trypsin-like serine 
protease family. 

A computerized search of GenEMBL databases using the 

15 FASTA program (Wisconsin Package Version 9.1, GCG, Madison, 
Wisconsin) for amino acid sequences homologous to the TADG-15 
protease domain revealed that homologies with other known human 
proteases never exceeds 55%. Figure 1 shows the alignment of the 
protease domain of TADG-15 compared with other human serine 

20 proteases. Using the BESTFIT program available through GCG, the 
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similarities between TADG-15 and trypsin, chymotrypsin, and tissue- 
type plasminogen activator are 51%, 46% and 52%, respectively. 

From the sequence derived from the TADG-15 catalytic 
domain, specific primers were synthesized to amplify a TADG-15- 
5 specific probe for library screening. After screening an ovarian 
carcinoma library, one 1785 bp clone was obtained which included 
the 3' end of the TADG-15 transcript. Upon further screening using 
the 5' end of the newly detected clone, two additional clones were 
identified which provided another 1362 bp of the cDNA, including 

10 the 5' end of the TADG-15 transcript. The total length of the 
sequenced cDNA was approximately 3.15 kb. The total nucleotide 
sequence obtained includes a Kozak's consensus sequence preceding 
a single open reading frame encoding a predicted protein of 8 55 
amino acids (Figure 2). 

15 The deduced open reading frame encoded by the TADG- 

15 nucleotide sequence (Figures 2, 3 and 4) contains several distinct 
domains as follows: an amino terminal cytoplasmic tail (amino acids 
(aa) #1-54), a potential transmembrane domain (aa #55-77), an 
extracellular membrane domain (aa #78-213), two complement 

20 subcomponents Clr/Cls, Uegf, and bone morphogenetic protein 1 
(CUB) repeats (aa #214-447), four ligand binding repeats of the low 



density lipoprotein (LDL) receptor-like domain (aa #453-602) and a 
serine protease domain (aa #615-855). The TADG-15 protein also 
contains two potential N-linked glycosylation sites (aa #109 and 302) 
and a potential proteolytic cleavage site upstream from the protease 
5 domain (aa #614) which could release and/or activate the protease 
at the carboxy end of this protein. In addition, TADG-15 contains an 
RGD motif (aa #249-251) which is commonly found in proteins 
involved in cell-cell adhesion. 

1 10 

[J EXAMPLE 11 

TADG-15 expression 
m To examine the size of the transcript for TADG-15 and its 

C 15 pattern of expression in various tissues, Northern blot hybridization 
was performed for representative histological types of carcinoma and 
in a series of cell lines, fetal tissues and normal adult tissues (Figure 
5). The transcript size for the TADG-15 message was determined to 
be approximately 3.2 kb and a single intense transcript appeared to 
20 be present in all of the carcinomas examined, whereas no visible 
band was detected in normal ovary (Figure 5). This transcript size is 
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also in good agreement with the sequence data predicting a 
transcript size of 3.15 kb. The ovarian tumor cell lines, SW626 and 
CAOV3, also showed an abundance of transcript, however little or no 
transcript was detectable in the breast carcinoma cell lines MDA-MB- 
5 231 and MDA-MB-4355. Among normal human fetal tissues, fetal 
kidney showed an abundance of the TADG-15 transcript and low 
expression was also detected in fetal lung. In normal adult tissues, 
TADG-15 was detected in colon with low levels of expression in small 
intestine and prostate (Figure 5). 

10 To evaluate mRNA transcript expression of TADG-15 in 

ovarian tumors and normal ovary, semi-quantitative PCR (Figure 6) 
was performed. In a preliminary study, the linearity of this 
assay 11 ' 12 was confirmed and its efficacy correlated with both 
Northern blots and immunohistochemistry. The data was quantified 

15 using a phosphoimager and compared as a ratio of expression (TADG- 
15/p -tubulin). Results herein indicate that TADG-15 transcript 
expression is elevated above the cut-off value (mean for normal 
ovary ± 2 SD) in all of the tumor cases examined and is either not 

detected or detected at extremely low levels in normal ovaries 
20 (Figure 6A and B). Analysis of ovarian carcinoma subtypes, including 
early stage and late stage disease, confirms overexpression of TADG- 



15 in all carcinomas examined (Table 2). All of the carcinomas 
studied, which included 5 stage I and 3 stage II carcinomas, showed 
overexpression of the TADG-15 gene. 

These data can also be examined with regard to tumor 
5 stage and histological sub-type, and results indicated that every 
carcinoma of every stage and histological sub-type overexpressed 
the TADG-15 gene. The expression ratio (mean value ± SD) for 

normal ovary group was determined as 0.182 ± 0.024, for LMP 

tumor group as 0.847 ± 0.419 and for carcinoma group as 0.771 ± 

10 0.380 (Table 2). A comparison between the normal ovary group and 
tumor groups showed that overexpression of the TADG-15 gene is 
statistically significant in both the LMP tumor group and the 
carcinoma group (LMP tumor: p<0.001, carcinoma: p<0.0001). 

As shown in Figure 6, TADG-15 transcripts were noted in 

15 all ovarian carcinomas, but were not present at detectable levels in 
any of the following tissues: a) normal ovary, b) fetal liver and brain, 
c) adult spleen, thymus, testes, ovary and peripheral blood 
lymphocytes, d) skeletal muscle, liver, brain or heart. This 
evaluation was extended to a standard panel of about 40 tumors. 

20 Using TADG-15-specific primers, the expression was also examined in 



tumor cell lines derived from both ovarian and breast carcinoma 
tissues as shown in Figure 7 and in other tumor tissues as shown in 
Figure 8. Expression of TADG-15 was also observed in carcinomas of 
the breast, colon, prostate and lung. 
5 Polyclonal antibodies developed to a synthetic peptide (a 

12-mer) at the carboxy terminus of the protease domain were used 
to examine TADG-15 expression in cell lines by Western blot and by 
immunolocalization in normal ovary and ovarian tumors. Western 
blots of cell extracts from SW626 and CAOV3 cells were probed with 

10 both antibody and preimmune sera (Figure 9). Several bands were 
detected with the antibody, including bands of approximately 
100,000 daltons, approximately 60,000 daltons and 32,000 daltons. 
The anticipated molecular size of the complete TADG-15 molecule is 
estimated to be approximately 100,000 daltons, and the protease 

15 domain which may be released by proteolytic cleavage at aa #614 is 
estimated to be approximately 32,000 daltons. Some intermediate 
proteolytic product may be represented by the 60,000 dalton band. 

Antibody staining of tumor cells confirms the presence of 
the TADG-15 protease in the cytoplasm of a serous LMP tumor, 

20 mucinous LMP tumor and serous carcinoma (Figure 10B, C & D, 
respectively). This diffuse staining pattern may be due to detection 



of TADG15 within the cell as it is being packaged and transported to 
the cell surface. In endometrioid carcinoma, the antigen is clearly 
detectable on the surface of tumor cells (Figure 10E). No staining 
was detected in normal ovarian epithelium or stromal cells (Figure 
5 10A). Immunohistochemical staining of a series of 27 tumors 
indicates the presence of the TADG-15 protein in all the carcinoma 
subtypes examined, including the low malignant potential group. 
Strong staining was noted in 7 of 9 low malignant potential tumors 
and 13 of 18 carcinomas (Table 3). 
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TABLE 2 



Number of cases with overexpression of TADG-15 in normal ovaries 



and ovarian tumors 



N overexpression 
of TADG-15 



expression ratio a 



Normal 



10 



0 (0%) 



0.182 ± 0.024 



LMP 



10 



10 (100%) 



0.847 ± 0.419 



10 



serous 



6 (100%) 



0.862 ± 0.419 



mucinous 



4 (100%) 



0.825 + 0.483 



Carcinoma 



3 1 



31 (100%) 



0.771 ± 0.380 



serous 



1 8 



18 (100%) 



0.779 ± 0.332 



mucinous 



7 (100%) 



0.907 + 0.584 



15 



endometrioid 3 



3 (100%) 



0.502 ± 0.083 



clear cell 



3 (100%) 



0.672 ± 0.077 



The ratio of expression level of TADG-15 to P-tubulin (mean ± SD) 



Si 



TABLE 3 



Immunohistochemical staining using TADG-15 



10 



15 



20 



25 



30 



Lab No. 


Histology 


1 ADCr-lD 




Surface epithelium of the ovary 




H-3 194 


serous (LMP) 


44 


H-162 


serous (LMP) 


44 


H-l 182 


serous (LMP) 


44 


H-4818 


serous (LMP) 


44 


TT A O O 1 

H-488 l 


/t a i"n\ 

serous (LMP) 


44 


TT y ^1 c 

H-675 


mucinous (LMP) 


+ 


TT A A y" 

H-2446 


mucinous (LMP) 


+ 


T T f \ ^ f\ ^ 

H-0707 


* /T H !t TA \ 

mucinous (LMP) 


44 


H-2042 


mucinous (LMP) 


44 


H-2555 


serous carcinoma 


44 


H-1858 


serous carcinoma 


44 


H-5266 


serous carcinoma 


44 


H-5316 


serous carcinoma 


+ 


H-2597 


serous carcinoma 


+ 


H-493 1 


mucinous carcinoma 


44 


H-l 867 


mucinous carcinoma 


H+ 


H-5998 


mucinous carcinoma 


44 


H-2679 


endometrioid adenocarcinoma 


+ 


H-5718 


endometrioid adenocarcinoma 


44 


H-3993 


endometrioid adenocarcinoma 


+ 


H-2991 


endometrioid adenocarcinoma 


44 


H-2489 


endometrioid adenocarcinoma 


44 


H-5994 


clear cell carcinoma 


44 


H-6718 


clear cell carcinoma 


44 


H-1661 


clear cell carcinoma 


44 


H-6201 


clear cell carcinoma 


44 


H-5640 


clear cell carcinoma 


+ 
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- Negative; + 
cell staining) 



Weak Positive; 4+ Strong Positive (more than 50% of 



«7 



EXAMPLE 12 



TADG-15 homology 

Recently, a mouse protein named epithin (GenBank 
5 Accession No. AF042822) has been described. 14 Epithin is a 902 
amino acid protein which contains a similar structure to TADG-15 in 
that it has a cytoplasmic domain, transmembrane domain, two CUB 
domains, four LDLR-like domains and a carboxy terminal serine 
protease domain. TADG-15 and epithin are 84% similar over 843 

10 amino acids, suggesting that the proteins may be orthologous (Figure 
11). The precise role of epithin remains to be elucidated. 

A search of GeneBank for similar previously identified 
sequences yielded one such sequence with relatively high homology 
to a portion of the TADG-15 gene. The similarity between the 

15 portion of TADG-15 from nucleotide #182 to 3139 and SNC-19 
GeneBank Accession No. #U20428) is approximately 97% (Figure 12). 
There are however significant differences between SNC-19 and 
TADG-15. For example, TADG-15 has an open reading frame of 85 5 
amino acids whereas the longest open reading frame of SNC-19 is 

20 173 amino acids. Additionally, SNC-19 does not include a proper 
start site for the initiation of translation, nor does it include the 



amino terminal portion of the protein encoded by TADG-15. 
Moreover, SNC-19 does not include an open reading frame for a 
functional serine protease because the His, Asp and Ser residues of 
the catalytic triad that are necessary for function are encoded in 
5 different reading frames. 

Implications 

The overall structure of the TADG-15 protein is relatively 
similar to the members of the tolloid/BMP-1 family and the 

10 complement subcomponents, Clr/Cls. These proteins contain both 
CUB and protease domains, and complex formation through the 
ligand binding domain is essential for their function. Activation of 
the serine protease domains of Clr and Cls requires proteolytic 
cleavage of Arg-Gly and Arg-Ile bonds, respectively. 15 Similarly, it 

15 might be expected that the TADG-15 protein is synthesized as a 
zymogen, which is activated by cleavage between Arg 614 and Val 615 
and analogous to the activation mechanism of other serine protease 
zymogens. Western blot analysis of cultured cell lysates confirmed 
both a 100 kDa and 32 kDa peptide, which correspond to the putative 

20 zymogen (whole molecule) and a cleaved protease product of TADG- 
15 (Figure 9). These data support a model for proteolytic release 



and/or activation of TADG-15 as occurs for similar type II serine 
proteases. 

CUB domains were first found in complement 
subcomponents Clr/Cls 16 " 18 and are known to be a widespread 
5 module in developmentally regulated proteins, such as the bone 
morphogenetic protein-1 (BMP-1) and the tolloid gene product. 18 " 20 
The role of these repeats remains largely unknown. However, some 
models suggest that the CUB domain may be involved in protein- 
protein interactions. The CUB domain of Clr and Cls participates in 

10 the assembly of the Cls-Clr-Clr-Cls tetrameric complex in the 
activation of the classical pathway of complement by providing 
protein-protein interaction domains. 15 The Drosophila 

decapentaplegic (DPP) protein is essential for dorsal-ventral 
specification of the embryo, and the Drosophila tolloid (TLD) forms a 

15 complex with DPP to regulate its activity. 19 ' 20 Missense mutations in 
the CUB domain of the tolloid protein results in a phenotype that 
does not allow a protein interaction with the DPP complex. 19 

The TADG-15 protein contains two tandem repeats of 
CUB-like domains between amino acid residues 214 and 447. Each of 

20 these is approximately 110 amino acids long and each has four 
conserved cysteine residues characteristic of other CUBs (amino acids 



214, 244, 268, 294, 340, 366, 397, 410). By analogy, the CUB repeats 
of the TADG-15 protein may form an interactive domain capable of 
promoting multimeric complex formation and regulating the activity 
of the target protein or TADG-15 itself. 
5 The TADG-15 protein also contains the LDL receptor 

ligand binding repeat (class A motif) -like domain, which consists of 
four contiguous cysteine-rich repeats (amino acid residues 453 to 
602). Each cysteine-rich repeat is approximately 40 amino acids long 
and contains a conserved, negatively-charged sequence (Ser-Asp- 

P 

« 10 Glu) with six cysteine residues. In the LDL receptor protein, this 
ji repeat is thought to function as a protein-binding domain which 

^ interacts with the lysine and arginine residues present in 

lipoproteins. 2122 In addition, the first repeat of the LDL receptor 
ry appears to bind Ca 2+ and not the lipoproteins. 23 By analogy, it is 

■5 15 possible that the LDL receptor-like repeat in TADG-15 may act in a 
similar fashion, interacting with positively charged regions of other 
proteins and/or as a Ca 2+ binding site. As a result of ligand binding 
and the formation of receptor-ligand complex, LDL receptor is 
internalized via clathrin-coated pits. 24 These types of plasma 
20 membrane receptors contain a characteristic amino acid sequence in 
their cytoplasmic domain for binding to clathrin-coated pits. 24 TADG- 
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15 does not contain this motif in its cytosolic region, and 
furthermore, no similarities with other known protein sequences 
were found in the cytoplasmic domain of the TADG-15. This finding 
suggests that TADG-15 functions in a different manner from the 
5 endocytic receptors (such as the LDL receptor), although TADG-15 
possesses similar ligand-binding repeats in the extracellular matrix. 

Although the precise role of TADG-15 is unknown, this 
gene is clearly overexpressed in ovarian tumors. A variety of 
proteases, such as type IV collagenase and plasminogen activator, 

10 appear to be involved in the process of tumor invasion and are 
constituents of a protease cascade in malignant progression. TADG- 
15 may constitute such an activity and directly digest extracellular 
matrix components surrounding a tumor, or activate other proteases 
by cleavage of inactive precursors, indirectly enhancing tumor 

15 growth and invasion. It is also possible that TADG-15 may function 
like a member of the tolloid/BMP-1 family by forming complexes 
with other growth factors or signal transduction proteins to modulate 
their activities. 

These data raise the possibility that the TADG-15 gene 

20 and its translated protein will be a useful marker for the early 
detection of ovarian carcinoma through release of the protease 



domain into the extracellular matrix and ultimately the circulation. 
These data also suggest the possibility of using TADG-15 as a target 
for therapeutic intervention through delivery systems directed at the 
CUB/LDLR ligand binding domains. 
5 The following references were cited herein: 

1. Liotta, L.A., et al. Cell, 64: 327-336, 1991. 

2. Duffy, M.J. Clin. Exp. Metastasis, 10: 145-155, 1992. 

3. Tryggvason, K., et al. Biochem. Biophys. Acta., 907: 191-217, 
1987. 

10 4. Levy, A.T., et al. Cancer Res., 51: 439-444, 1991. 

5. Monsky, W.L. et al. Cancer Res., 53: 3159-3164, 1993. 

6. Duffy, M.J. et al. Cancer, 62: 531-533, 1988. 

7. Hackel, C, et al. Cancer, 79: 53-58, 1997. 

8. Watt, K, et al. Proc. Natl. Acad. Sci. U.S.A., 83: 3166-3170, 1986. 
15 9. Tanimoto, H. et al. Cancer Res., 57: 2884-2887, 1997. 

11. Shigemasa, K. et al. J. Soc. Gynecol. Invest., 4: 95-102, 1997. 

12. Tanimoto, H. et al. Gynecol. Oncol., 66: 308-312, 1997. 

13. Maniatis, T., Fritsch, E.F. & Sambrook, J. Molecular Cloning, p. 
309-361 Cold Spring Harbor Laboratory, New York, 1982. 

20 14. Kim, M.G., et al. Immunogenetics, 49(5): 420-428, 1999. 
15. Arlaud et al. Method in Enzymology, 223: 61-82, 1993. 



16. Journet, A. & Tosi, M. Biochem. J., 240: 783-787, 1986. 

17. Mackinnon, CM., et al. Eur. J. Biochem., 169: 547-553, 1987. 

18. Bork, P. & Beckmann, G. J. Mol. Biol., 231: 539-545, 1993. 

19. Childs, S.R. & O'Connor, M.B. Dev. Biol., 162: 209-220, 1994. 
5 20. Blader, PL, et al. Science, 278: 1937-1940, 1997. 

21. Yamamoto, T. et al. Cell, 39: 27-38, 1984. 

22. Daly, N.L., et al. Proc. Natl. Acad. Sci., 92: 6334-6338, 1995. 

23. van Driel, I.R., et al. J. Biol. Chem., 262: 17443-17449, 1987. 

24. Lodish, H. et al. Sorting of membrane proteins internalized from 
10 the cell surface. In: Molecular Cell Biology, 3rd ed., p. 7 22-7 3 3 

Scientific American Books, Inc., New York, 1995. 

25. Parker, KC et al. J. Immunol. 152:163, 1994. 

Any patents or publications mentioned in this 
15 specification are indicative of the levels of those skilled in the art to 
which the invention pertains. These patents and publications are 
herein incorporated by reference to the same extent as if each 
individual publication was specifically and individually indicated to 
be incorporated by reference. 
20 One skilled in the art will readily appreciate that the 

present invention is well adapted to carry out the objects and obtain 



the ends and advantages mentioned, as well as those inherent 
therein. The present examples along with the methods, procedures, 
treatments, molecules, and specific compounds described herein are 
presently representative of preferred embodiments, are exemplary, 
5 and are not intended as limitations on the scope of the invention. 
Changes therein and other uses will occur to those skilled in the art 
which are encompassed within the spirit of the invention as defined 
by the scope of the claims. 



WHAT IS CLAIMED IS: 



1. DNA encoding a tumor antigen-derived gene 
(TADG-15) protein, selected from the group consisting of: 

5 (a) isolated DNA which encodes a TADG-15 protein; 

(b) isolated DNA which hybridizes under high 
stringency conditions to the isolated DNA of (a) above and which 
encodes a TADG-15 protein; and 

(c) isolated DNA differing from the isolated DNAs of (a) 
10 and (b) above in codon sequence due to the degeneracy of the 

genetic code, and which encodes a TADG-15 protein. 

2. The DNA of claim 1, wherein said DNA has the 
15 sequence shown in SEQ ID No. 1. 

3. The DNA of claim 1, wherein said TADG-15 protein 
has the amino acid sequence shown in SEQ ID No. 2. 

20 
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4. A vector comprising the DNA of claim 1 and 
regulatory elements necessary for expression of said DNA in a cell. 

5 5. The vector of claim 4, wherein said DNA encodes a 

TADG-15 protein having the amino acid sequence shown in SEQ ID 
No. 2. 

10 6. The vector of claim 4, wherein said DNA is 

positioned in reverse orientation relative to said regulatory elements 
such that TADG-15 antisense mRNA is produced. 

15 7. A host cell transfected with the vector of claim 4, 

said vector expressing a TADG-15 protein. 

8. The host cell of claim 7, wherein said cell is selected 
20 from the group consisting of bacterial cells, mammalian cells, plant 
cells and insect cells. 

11 



9. The host cell of claim 8, wherein said bacterial cell 

is E. coli. 



10. Isolated and purified TADG-15 protein coded for by 
5 DNA selected from the group consisting of: 

(a) isolated DNA which encodes a TADG-15 protein; 

(b) isolated DNA which hybridizes under high 
stringency conditions to isolated DNA of (a) above and which encodes 
a TADG-15 protein; and 

10 (c) isolated DNA differing from the isolated DNAs of (a) 

and (b) above in codon sequence due to the degeneracy of the 
genetic code, and which encodes a TADG-15 protein. 

11. The TADG-15 protein of claim 10, wherein said 
15 protein has the amino acid sequence shown in SEQ ID No. 2. 

12. A method for detecting TADG-15 mRNA in a 
sample, comprising the steps of: 

(a) contacting a sample with a probe, wherein said 
20 probe is specific for TADG-15; and 

i a 



(b) detecting binding of said probe to TADG-15 mRNA 
in said sample. 

13. The method of claim 12, wherein said sample is a 
5 biological sample. 

14. The method of claim 13, wherein said biological 
sample is from an individual. 

10 

15. The method of claim 14, wherein said individual is 
suspected of having cancer. 

15 

16. A kit for detecting TADG-15 mRNA, comprising: 

an oligonucleotide probe, wherein said probe is specific 
for TADG-15. 

17. The kit of claim 16, further comprising: 
20 a label with which to label said probe; and 

means for detecting said label. 
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18. A method of detecting TADG-15 protein in a 
sample, comprising the steps of: 

(a) contacting a sample with an antibody, wherein said 
antibody is specific for TADG-15 or a fragment thereof; and 
5 (b) detecting binding of said antibody to TADG-15 

protein in said sample. 



19. The method of claim 18, wherein said sample is a 
10 biological sample. 

20. The method of claim 19, wherein said biological 
sample is from an individual. 

15 21. The method of claim 20, wherein said individual is 

suspected of having cancer. 



22. A kit for detecting TADG-15 protein, comprising: 
20 an antibody, wherein said antibody is specific for TADG- 

15 protein or a fragment thereof. 
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23. The kit of claim 22, further comprising: 
means to detect said antibody. 

5 24. An antibody, wherein said antibody is specific for 

TADG-15 protein or a fragment thereof. 

25. A method of screening for compounds that inhibit 
10 TADG-15, comprising the steps of: 

(a) contacting a sample with a compound, wherein said 
sample comprises TADG-15 protein; and 

(b) assaying for TADG-15 protease activity, wherein a 
decrease in said TADG-15 protease activity in the presence of said 

15 compound relative to TADG-15 protease activity in the absence of 
said compound is indicative of a compound that inhibits TADG-15. 

26. A method of inhibiting expression of TADG-15 in a 
20 cell, comprising the step of introducing the vector of claim 6 into a 

cell, wherein expression of said vector produces TADG-15 antisense 

il 



mRNA in said cell, wherein said TADG-15 antisense mRNA hybridizes 
to endogenous TADG-15 mRNA, thereby inhibiting expression of 
TADG-15 in said cell. 

5 27. A method of inhibiting a TADG-15 protein in a cell, 

comprising the step of introducing an antibody into a cell, wherein 
said antibody is specific for a TADG-15 protein or a fragment thereof, 
wherein binding of said antibody to said TADG-15 protein inhibits 
said TADG-15 protein. 

10 

28. A method of targeted therapy to an individual, 
comprising the step of: 

(a) administering a compound to an individual, 
wherein said compound has a targeting moiety and a therapeutic 
15 moiety, wherein said targeting moiety is specific for TADG-15. 

29. The method of claim 28, wherein said targeting 
moiety is selected from the group consisting of an antibody specific 

20 for TADG-15 and a ligand or ligand binding domain that binds TADG- 
15. 
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30. The method of claim 28, wherein said therapeutic 
moiety is selected from the group consisting of a radioisotope, a 
toxin, a chemotherapeutic agent, an immune stimulant and a 
cytotoxic agent. 



31. The method of claim 28, wherein said individual 
suffers from ovarian cancer, lung cancer, prostate cancer, colon 
cancer and other cancers in which TADG-15 is overexpressed. 

10 



32. A method of diagnosing cancer in an individual, 
comprising the steps of: 

(a) obtaining a biological sample from an individual; 
15 (b) detecting TADG-15 in said sample, 

wherein the presence of TADG-15 in said sample is 
indicative of the presence of carcinoma in said individual, wherein 
the absence of TADG-15 in said sample is indicative of the absence of 
carcinoma in said individual. 
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33. The method of claim 32, wherein said biological 
sample is selected from the group consisting of blood, urine, saliva, 
tears, interstitial fluid, ascites fluid, tumor tissue biopsy and 
circulating tumor cells. 

5 

34. The method of claim 32, wherein said detection of 
said TADG-15 is by means selected from the group consisting of 
Northern blot, Western blot, PCR, dot blot, ELIZA sandwich assay, 

10 radioimmunoassay, DNA array chips and flow cytometry. 

35. The method of claim 32, wherein said carcinoma is 
selected from the group consisting of ovarian, breast, lung, colon, 

15 prostate and others in which TADG-15 is overexpressed. 

36. A method of vaccinating an individual against 
TADG-15, comprising the steps of: 

20 inoculating an individual with a TADG-15 protein or 

fragment thereof, wherein said TADG-15 protein or fragment thereof 
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lacks TADG-15 protease activity, wherein said inoculation with said 
TADG-15 protein or fragment thereof elicits an immune response in 
said individual, thereby vaccinating said individual against TADG-15. 

5 

37. The method of claim 36, wherein said individual 
has cancer, is suspected of having cancer or is at risk of getting 
cancer. 

10 

38. The method of claim 36, wherein said TADG-15 
fragment is selected from the group consisting of a 9-residue 
fragment up to a 20-residue fragment. 

15 

39. The method of claim 38, wherein said 9-residue 
fragment is selected from the group consisting of SEQ ID Nos. 2, 19, 
20, 21, 29, 39, 49, 50, 59, 79, 80, 81, 82, 83, 84, 89 and 90. 

20 
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40. A method of producing immune-activated cells 
directed toward TADG-15, comprising the steps of: 

exposing dendritic cells to a TADG-15 protein or fragment 
thereof, wherein said TADG-15 protein or fragment thereof lacks 
5 TADG-15 protease activity, wherein said exposure to said TADG-15 
protein or fragment thereof activates said dendritic cells, thereby 
producing immune-activated cells directed toward TADG-15. 

10 41. The method of claim 40, wherein said immune- 

activated cells are selected from the group consisting of B-cells, T- 
cells and dendrites. 

15 42. The method of claim 40, wherein said TADG-15 

fragment is selected from the group consisting of a 9-residue 
fragment up to a 20-residue fragment. 
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43. The method of claim 42, wherein said 9-residue 
fragment is selected from the group consisting of SEQ ID Nos. 2, 19, 
20, 21, 29, 39, 49, 50, 59, 79, 80, 81, 82, 83, 84, 89 and 90. 



5 

44. The method of claim 40, wherein said dendritic 
cells are isolated from an individual prior to said exposure, wherein 
said activated dendritic cells are reintroduced intc said individual 
subsequent to said exposure. 

10 

45. The method of claim 44, wherein said individual 
has cancer, is suspected of having cancer or is at risk of getting 
cancer. 

15 

46. An immunogenic composition, comprising an 
immunogenic fragment of a TADG-15 protein and an appropriate 
adjuvant. 

20 
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47. The immunogenic composition of claim 46, wherein 
said fragment is selected from the group consisting of a 9-residue 
fragment up to a 20-residue fragment. 

5 

48. The immunogenic composition of claim 47, wherein 
said 9-residue fragment is selected from the group consisting of SEQ 
ID Nos. 2, 19, 20, 21, 29, 39, 49, 50, 59, 79, 80, 81, 82, 83, 84, 89 and 
90. 

10 

49. An oligonucleotide having the nucleotide sequence 
complementary to a sequence of claim 1. 

15 

50. A composition comprising the oligonucleotide 
according to claim 49 and a physiologically acceptable carrier 
therefore. 

20 
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51. A method of treating a neoplastic state in an 
individual syndrome in an individual in need of such treatment, 
comprising the step of administering to said individual an effective 
dose of the oligonucleotide of claim 49. 

5 

52. The method of claim 51, wherein said neoplastic 
state is selected from the group consisting of ovarian cancer, breast 
cancer, lung cancer, colon cancer, prostate cancer and other cancers 

10 in which TADG-15 is overexpressed. 



ABSTRACT OF THE DISCLOSURE 



The present invention provides DNA encoding a TADG-15 
protein as well as a TADG-15 protein. Also provided is a vector 
5 capable of expressing the DNA of the present invention adapted for 
expression in a recombinant cell and regulatory elements necessary 
for expression of the DNA in the cell. The present invention further 
provides for methods of inhibiting TADG-15 expression and/or 
protease activity, methods of detecting TADG-15 mRNA and/or 

10 protein and methods of screening for TADG-15 inhibitors. 
Additionally, the present invention provides for cell-specific 
targeting via TADG-15 and methods of vaccinating an individual 
against TADG-15. The methods described are useful in the diagnosis, 
treatment and prevention of cancer, particularly breast and ovarian 

15 cancer. 
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Heps RIVGGRD7SL 

TadglS RVVGG7DADE 

Scce KIIDGAPCAR 

Try KIVGGYNCEE 

Ch yzib R I VN GE DA V ? 

Fac7 RIVGGXVCPX 

Tea RIKGGLFADI 



GRWFWQVSL RYDG.A KLCGGSLLSG 

GEWPWQVSL HALGQG HICGASLISP 

GSKPWQVAL LSGNQL H . CGGVLVNE 

NSVPYQVSL NSGYHF . . CGGSLINE 

GSWPWQVSL QDKTGF HFCGGS1ISE 

GECPWQVLL LVNG.A QLCGGT1INT 

ASKPWQAAIF AKHRRSPGER FLCGGILISS 



DWVLTAAHCF PE RNRV LSRWRVFAGA VAQASFHGLQ 

KWLVSAAHCY IDDRGFRYSD P7QW7AFLGL HDQSQRSAPG 

RWVLTAAKC K MNEYTVHLGS DTLG..DR.R 

QWVVSAGHC Y KSRIQVRLGE HMIEVLEG.N 

DWVVTAAKC GV RTSDVVVAGE FDQGSDEE.N 

IKVVSAAKCF DKIKNWRNLI AVLGE HDLSEnDGDE 

CWILSAAHCF QERFPPHHL TVILGR . TYRVVrGEE 



Heps LGVQAWYHG GYLPFRDPNS EENSNDIALV HLSS.PLPLT EYIQPVCLPA . . . AGQALVD GKIC7VTGWG NTQYYGQQ. A 

TadglS VQERRLKRII SHFFFNDFTF D...YDIALL ELEK.FAEYS SKVRPICLPD . ..ASHVFPA GKAIKVTGWG HTQYGGTG. A 

Scce AQRIKASKSF RHPGYSTQT . . .HVNDLMLV KLNS . QARLS SMVKKVRLPS . . . RCE . . P? G7TC7VSGV1G TTTSPDV7F? 

Try EQFIKAAKII RKPQYDRKT. . . LNNDIMLI KhSS . RAVIN ARVSTISLP7 . ..APP..A7 GTKCLISGWG NT AS SG AD VP 

Chv^JD IQVLKIAKVF KMPKFSILT . . . VNNDITLL KLAT.PARFS QTVSAVCLPS . ..ADDDFPA G7LCA77GWG KTKYNANKTP 

Fac7 QSRRVAQVII ? STYV? GTTNHDIALL RLHQ.rVVLT D.HVVPLCLPE RTFSERTLAF VRFSLVSGV-G QLLDRGATAL 

Tpa EQXFEVEKYI VHKEFDDDTY D . . , NDIALL QLKSDSSRCA CESSWRTVC LPPADLQLPD WTECELSGYG KKEALSPFYS 



Heps GVLQEARV?: ISNDVCNGAD FYGN..QIKP KM FCA.GY PEG G IDA CQGDSGGPFV CEDSISRTPR WRLCGIVSWG 

TadgIS LILQKGSIRV INQTTCE . . N LLPQ..QIT? RMMCVGFLSG G VDS CQGDSGGPL. . .SSVEADGR IFQAGVVSWG 

Scce SDLKCVDVXL 1S?QDCTKV . .YKD..LLEN SMLCAGIPDS K KKA CNGDSGGPLV C R GTLQGLVSWG 

Try DELQCLDAPV LSQAKCEAS . . YPG. . KITS NMFCVGFLEG G KDS CQGDSGG?VV C N GQLQGVVSWG 

Chyntb DKLQQAALPL LSUhZCKKS. .WGR..RI7D VMICAG. .AS G VSS CMGDSGGPLV C QKDGA W7LVGIVSWG 

Fac7 ELMVLNVPRL MTQDCLQQSR XVGDSPNI7E YMFCAGYSDG S KDS CKGDSGGP. . . . HATHYRGT WYLTGIVSWG 

Tpa ERLKEAKVRL YtSSRCTSQH LLNRT..VTD NMLCASD7RS GGPQANLHDA CQGDSGGPLV CLN DGR M7LVGIISWG 



Heps 7.GCALAQK? GVY7KVSDFR EWIFQAIKTH SEASGXVIQL — 

TadglS D. GCAQRNK? GVYTRLPLFR DWIKENTGV- — ~ 

Scce 7FPCGQPND? GVYTQVCKF7 KWINDTMKKH R~~--~-- ~ 

Try -D.GCAQKNK? GVYTKVYNYV KWIKN7IAAN S — — ~ 

Chyi?£> SDTCS.TSS? GVYARVTKLI FWVQKILAAN — — 

Fac7 Q.GCA7VGHF GVYTRVSQYI EWLQKLMRSE PRPGVLLRAP F? 

Tpa .LGCGQXDV? GVYTKVTNYL DWIRDNMRP- ~ — 



1 TCAAGAGCGGCCTCGGGGTACCATGGGGAGCGATCGGGC^ 

MGSDRARKG GGGPKDFGAGLKYKSRBEKVNGLE 
121 GGAAGGCGTGGAGTTCCTGCCAGTCAACAACGTCAAGAAGGTG^ 

EGVEFLPVNHVKKVEKHGPGR WVVE.AAVLIGLLLVLLGIG 
241 CTTCCTGGTGTGGCAITTGCAGTACCGGGJtfOTGCGTGTCC^ 

XVFNGYMRITHENFVDAYENS H J * ■ y 
rrACAGCGGAGTCCCATICCI GGGC CCCTACCACAAGGAGTCGGC^ 

YSGVPFI»GPYHKESAVTAFSEGSVIA 
CGTCATGGCCGAGGAGCGCGXAGTCAXGCTGCCCCCt 



361 

481 

€01 

721 

841 

961 

1081 

1201 

1321 

1441 

1561 

1681 

1801 

1921 

2041 

2161 

2281 

2401 

2521 

2641 
2761 
2881 
3001 
3121 



VSLASKVKDALKL 
CTACTACTGGTCTGAGTTCAGCAT(XCGCAGCACCTG 
YYWSEFSIPQHLV 



EEAERVMAE 



ERVVMLPPRARSLKSPVV 

!CCCCGCTGTGGAGC?GAIGCGCTTC*CC AC G CCC GGCTT 
FGLHARGVELMRFTTPGF 




LSLTFRSFDLASCDERGS 

:CTACAAarrG Ai xTiu jL i.iu:TcccAGAACGra:TU-T 

TYPPSYNLTFHSSQHVLL 



RMS 



s S 



GGRLRKAQGTFN S 



TSVVAF PTDSKTV 
CCCTGACAGCCCCTACCCCGCTCAIGCi 
PD5PYPAEA 

CGACCTGGTGACGGTGTACAACACCCtGAGCCCCATGGAGCCCCACGCC^ 

DLVTVYNTLSPMEPHALVQL?G 
CATCACACTGATAACCAACACTGAGCGGCGGCATCC ^^ 

ITLITNTBRRHPGFEATFFQIiP 
CCCCTACTACCCAGGCCACTACCCACCCAACATTGAC^^ 

PYYPGHYPPNIDCTWM IEVPNNQHVKVSFKFFYLLEPGVP 
TGCGGGCACCTSCCCCAAGGACTACGTGGAGATCAATGGGGAGA 

AGTCPKDYVEZHGEKYCGERSQFVVTSNSNEITVRF35DQ 
GTCCTACACCGACAC CGG CTTCTTAGCTGAATACCTCTCCTACGACTCCAGTGACCCft 

SYTDTGFLAEYLSYDSSDPCPGQFTCRTGRCIRKELRCDG 
CTGGGCCGACTGCACCGACCA CAGCGATGAG CTCAACTCCAGTTGCGA^ 1 lUW»lll'^GACAGTGTGAACGA 

V A D C T D B |s D E | l MCSCDAGHQFTCKKKFCKPirWVCDSVHD 
CTGCGGAGACAA CAGCGACGAG CAGGGGTGCAGTTGTCCGGCCCAGACCTTC^ 

C G D N |S D E j Q GCSCPAQTFRCSNGKCX.SKSQQCNGXDDCGDG 
GTCCGACGAGGCCTCCTGCCCCAAGGTGA 

|S D E | A SCPKVHVVTCTKHTYRCLNGLCLSKGHPECDGKEDC 
TAGCGACGG CTCAGATCUVG AAGGACIGCGACTCTGGGCTGCCCTC 
SVG |S D E |lC DCDCGLRSFTRQA R*V VGGTDADEGEWPWQVSI.H 

LTGACAGAGGAXTGAGGTACTCAGACCCCACGCAGTGGAC 
FRYSD P T Q K T 




TGCTCTGGGCCAGGGC 
A L G Q G H I 

CGCCTTCCTCG tX ri XjC ACGACCAGAGCCAGCGC^ r Cl ICAATGACTTCA LLl 1U« ACTAIGACATCCC 

AFLGLBDQSQRSAPGVQERRLKRIZSBPrFHDFTFOY I A 
GCTGCTqaGCTGGAGAAACCGGCAGAGTACAGCTCC^ 

I*LELEKPAEYSSMVRPICLPDASHVFPAGKAZIIVTGNGHT 
CCAGTATGGAGGGACTGGCGCGCTGAXCCTGCAAAAGG^ 

QYGGTGALZLQKGEIRVZHQTTCEHLLPQQI7PRKMCVGF 
CCTCAGCGGCGGCGTGGACTCCTtXCAGGGTGATTCCGGGGGACCCCTGTCO 

LSGGVOSCQGO (s) GGFLSSVEADGRIFQAGVVSWGOGCAQR 
GAACAAGCCAGGCGTGTACACAAGGCTCCCTCTCTTTCGGGACTGG 

NKPGVYTRLPLFRDWIKEHTGV 
CCCAGTGTGCACGCCTGCAGGCTGGAGACTGGACCGCTGA^ 
CAGCCTCCAAAGTGGAGCTGGGAGGTAGAAGGGGAGGACACTGGTGGTTCT 
CGTTTGTGTATATCTGCCTCCCCTGTCTGTAAGGAGCAGCGG 
TGAGGAAGCCCAGGCTCGGAGGACCCTGGAAAACAGACGGGTCTGAG 
TTCTTTTTAAAAAAAAAAAAAAAAAAA 
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713 
753 
793 
833 
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hTADGIS MGSDRARKGG GGPKDFGAGL KYNSRHEKVN GLEEGVEFLP VNNVKKVEKH 50 

mEpithin N-G— A SQ D — L-NM F A — A R 

hTADGIS GPGRWWLAA VLIGLLLVLL GIGFLVWHLQ YRDVRVQKVF NGYMRITNEN 100 

mEpithin — R V FSF — LS- MA-L FH ~N HL 1 

hTADGIS FVDAYENSNS TEFVSLASKV KDALKLLYSG VPFLGPYHKE SAVTAFSEGS 150 

mEpithin -L T 1 Q- -E NE — V K 

hTADGIS VIAYYWSEFS IPQHLVEEAE RVMAEERWM LPPRARSLKS FWTSWAFP 200 

mEpithin P--A--VD -A--V T A L 

hTADGIS TDSKTVQRTQ DNSCSFGLHA RGVELMRFTT PGFPDSPYPA HARCQWALRG 250 

mEpithin I-PRML A H-AAVT N V 

hTADGIS DADSVLSLTF RSFDLASCDE RGSDLVTVYN TLSPMEPHAL VQLCGTYPPS 300 

mEpithin V-P H D S V -R FS— 

hTADGIS YNLTFHSSQN VLLITLITNT ERRHPGFEAT FFQLPRMSSC GGRLRKAQGT 350 

mEpithin L F-V G L K V-SDT 

hTADGIS FNSPYYPGHY PPNIDCTWNI EVPNNQHVKV SFKFFYLLEP GVPAGTCPKD 400 

mEpithin -S N K EN R — L VD- n — V-S-T — 

hTADGIS YVEINGEKYC GERSQFWTS NSNKITVRFH SDQSYTDTGF LAEYLSYDSS 450 

mEpithin GS S S H H N 

hTADGIS DPCPGQFTCR TGRCIRKELR CDGWADCTDH SDELNCSCDA GHQFTCKNKF 500 

mEpithin M-M-K P-Y RY-R-N- T Q- 

hTADGlS CKPLFWVCDS VNDCGDNSDE QGCSCPAQTF RCSNGKCLSK SQQCNGKDDC 550 

mEpithin G E GS- K PQ — K N- 

hTADGlS GDGSDEASCP KVNWTCTKH TYRCLNGLCL SKGNPECDGK EDCSDGSDEK 600 

mEpithin D S S Y Q T 

hTADGIS DCDCGLRSFT RQARWGGTD ADEGEWPWQV SLHALGQGHI CGASLISPNW 650 

mEpithin N K N L D- 

hTADGlS LVSAAHCYID DRGFRYSDPT QWTAFLGLHD QSQRSAPGVQ ERRLKRIISH 700 

mEpithin FQ KN-K Y- M L K S LK T- 

hTADGlS PFFNDFTFDY DIALLELEKP AEYSSMVRPI CLPDASHVFP AGKAIWVTGH 750 

mEpithin -S S v TV T 

hTADGIS GHTQYGGTGA LILQKGEIRV INQTTCENLL PQQITPRMMC VGFLSGGVDS 800 

mEpithin KE D-M 

hTADGIS CQGDSGGPLS SVEADGRIFQ AGWSWGDGC AQRNKPGVYT RLPLFRDWIK 850 

mEpithin A-K M E CSSGLDQ 

hTADGIS ENTGV* 900 

mEpithin RAHWGIAAWT DSRPQTPTGM PDMHTWIQER NTDDIYAVAS PPQHNPDCEL 

hTADGIS 902 

mEpithin HP 



LOCUS HSU2042B 2900 bp mRKA ?HI 17-M--n-:597 

DEFINITION Human SNC19 mRNA sequence. 

ACCESSION U20428 

NID gl890631 

KEYWORDS 

SOURCE human. 
ORGANISM Homo sapiens 

Eukaryotae; mitochondrial eukaryctes; Xe:azc= ; Chorcsta; 

Vertebrata; Eutheria; Primates; Cstarrr.i.-u; Kcrvm idae; H=mo. 
REFERENCE 1 (bases 1 to 2900) 

AUTHORS Zheng, S., Cai.X., Geng,L., Cao,J., Ir.sr.s.L. ar.d ZM.'.Z. 

TITLE SNC19 gene in Kcmo sapiens 

JOURNAL Unpubl i s hed 

REFERENCE 2 (bases 1 to 2900} 

AUTHORS Zheng, S. 

TITLE Direct Submission 

JOURNAL Submitted {30- JAN- 1995) Shu Zheng, Zz'zez Ir.szitute, Ir.ejiar.g 

Medical University, Hangzhcu, 3100C r , Peoples Republic c: Cl-.ir.a 

TADG15: TCAAGAGC GGCCTCGGGGT ACCATGGGGA I- T GAT CG 3 GC C C GCAAG 3 3 CGGAGGG GG C C CG-AGGAC 7T C GG CG CGGGAC T 81 

SKC19: 

e 2 CA-.GTAC. 5 ACTCCCGGCACGAGAAAGTGAATGGCTTGGAGGAAGGCGT 3 I-.37 T CC T G Z 7AG7CAA 3.-ACG7 CAASAASG7 GGAAAAG CAT GGC CC GGGG 1 S 1 



182 CGCTGGGTGGTGCTGGCAGCCGTGCTGATCGGCCTCCTCTTGGTCTT^^ 281 
M I I I I II II IliiiinilllM Ml UHMIIIIIMMiltM: , MMMiUMIMMMMMIMMMMIUMMMMMMIM 
1 CGCTGGGTGGTGCTGGCAGCCGTGCTGATCGGCCTCCTCTTGGTCT7G:7GGGGA7CG^ 100 

•> 8 -> AGAAGGTCTT CAAT GGC7ACATGAGGATCACAAATGAGAATT T TG7GGAT G CCTACGASAACTCC^CT CCACT GAG7 T7GTAAGCCTGGCCAGCAAGGT 3 8 1 

" " I M M I 1 1 M M M II II M II M i I I I M I M I M M M M M M I : " ' M I ! Mi i ! 1 ! I 1 I I I I ! i I ! i I II I M 1 I! I I II I f I I Ml I I I I III 

101 A 3AAGGTCTTCAATGGCTACAT GAGGATC ACAAATGAGAATTTTG7GGAT 3 CC7 AC GA GAACTCCAAC T CCAC 7 GA3T i 7 G i .-.-iGCCT uG CCnGCAAGG i 200 

-82 GiAGGACGCGCTC^AGCTGCTGTACAGCGCaGTC^^ 461 

i i j I I l l ) l l I l ) I Jt M II J II I I I I I I I I I I I I I I M I I I I I I I I I I I ! I * • : ! I I I I ' . ; ! M > I ! ! 1 i I M I i M I I I I t 1 ! I I i I I I I It 
201 GAAGGAC GCGCTGAAGCTGCTGTACAG CGGAGTCCCATTCCTGGGC CC "AC CACAAG GAGT C G G 77 G7GACG G CC77 CAGCGAGG GCAGCGT CATCGCC 300 

tictactggtc^gttcagcatcccgcac^cc^^ 581 

I M I! t! I ill HI I I I i I I I I If I MiMIHIMl I Ml I II Hi. MIIMM'MII i . M I 11 M M M M M I M M M I M M M II M I 
301 TACTACTGGTCTGAGTTCAGCATCCCGCAGCACCTGGTTGAGGAGGCCGAGCGCGTCA7 GGCC . A3GAGCGCGTAGTCATGCT GCCCCCGCGGGCGCGCT 399 

582 CCCTGAAG7CCTTTGTGGTCACCTCAGTGG7GGCTTTCCCCACGGAC7CCAAA^ €81 

1 I [ i M I II I I I M M t M M I I I I I I M 1 1 M M M I M I M I M I : ' M I I I M I I M M M.M I M I M I I II I M M I M I M M I M M M M 
4 o 0 CCCTGAAGTCCTTTGTGGTCACCTCAGTGGTGGCTTTCCCCACGa^CT 2 CAAAACA.GT ACAGAGGACCCnGGACAACAGCTGCAGCTT TGGCCTGCACG . 498 

€? o ccgcggtgtggagctgatgcgcttcaccacgcccggcttccctg^a^ m 

- M I M M M It M M M It M M M i M 1 1 1 MIMMMMIMM I M ! \ M ^Mllll::il I III I I MMM I I MM I I MM 

-99 cCGCGG7GTGGAGCTaATGCGCTTC^CACG.CCGGCTTCCCTCLACA:::CCTACCC:CCTCATG2CCGCTC<CAG7GGC^ i GCGoGGACG . CGAC 592 

iqi ~C S GTGC T GAGCCTCACCTTC CGCAGCTTTGACCTTGCGT CC 7 3 2 GACGAGC GC GGCAGCGACCT GGT GACGGT GTACAACACCC7GAGCCCCAT 876 

YmmMIMI II I MMM DIM Mi < 1 1 M I M M M M I I M I I II I I I M M 1 I I I M M I M M II I II I M 

593 GCAGTGCTGAGCTACTCGAGCTGACTCGCAGC .TTGACTGCGCCT . . . . CGACGAGCGCGGCAGC3ACCTGGTGAC . GTGTA.CAACACCCTGAGCCCCAT 686 

877 GGAGCCCC-CGCCCTGGTGCAGTTGTGTGGCACC^^ 974 

IHIIIllM! MMM! M 1 I M t M M If II II I M M M : i M I M M I M II I 1 II MMM! II I I II M M II I M I M M I I M M 
6S7 GGAGCCCCACG . CCTGGTG . .AGTGTGTGGC*CCTACCCTCCCTCCTA-«^^ 7B3 

975 cCAACACTGA<KGGC(SCATCCCGGCm^ 1074 

I 1 | | || | | | | | | | j | | | | | M I M II M M II I I M I 11 M M I 1 1 . ! M M I M II M I M M II I I II I M M M I M II I II M I M M I M M 
784 CCAACACTGA. . CGCKCATCCCGGCTTTt^CCAOT 881 

1075 c^C^GCCCCTACTACCCa.GGCCACTACCCACCCAA^ 1174 
I Ml II II M M M M M II M II II 11 M II II M II II M M II M : • 1 I M I I ! I M M M M I ! II II II M M II M I M I I I II I I I III I I 
692 CAACAGC CCCTACT ACCCAGGCCACTAC CCACCCAACAT TGAC7GCACA7 GGAAAA7 7 GAGGT G CCCA.-.CAACCAGCATGT GAA^G i wCGCTTCA«ATT C 981 

1175 T 7 C TAC CTGCTGGAGCCCGGCGTGC CTGCGGGCACCT GC C CCAAG GAC 7 ACGT GGAGA7 CAAT GG 3 GAGAAA7A.CT GCGGAIl^CAGGT C CCAGT T CGJCG 1274 
I M M ! M M I M M II II M I M M M M M I i I M II I II I I M i : M II i ! M ! It II I I 1 1 I 1 I M M I I I I M M M I M M ! II I M II I MM 
932 ~7CTACC7GCTGGAGCCCGGCGTGCCTGCGGGCACCTGCCCCAAGGAC7ACG7GGAGA7CAATGG^ 1081 

1275 T^cc^-nC^GCAACAA^TCACAGT^^ 1374 

1 j f| | m | | | | i I | m t j M I I II I I I I M M I I! M M 1 M M I M I : I I I I I I I i I I I I I I i !! I I I I I I I I I I I I I 1 I I I I I I IN I I I M 1 1 1 1 I 
1082 tokoIg^ 1151 

1 375 CCCATGCCCGGGGCAGTTCACGTGCCGCACGGGGCGGTGTATCCGGAAGGA 1 474 

, im ! H*! M M M M M M M M II M II I M I M ! I I M ' i ! I I M II i M I I M ! ! I M M I I M M M I M M II M M M M M I M 

1192 cccatgcCCGGGGCAGTTCACGTGCCGCACGGGGCGGTGTATC^ 12S0 

14 75 1 1 CTGCAGTTGCGACGCCGGCCACCAGTTCACGTGCAAGAACAAGTT C7 3 7AAGCC C C 7 CTT CTGGGTCTGC GACAG7 GT GAACGACT GCGGAGACAACA 1574 

',7| M M II M M I II M II I M M I It I I M M M II M I II M M : ! M II I { M I M M M M M II I M M M M M I I M I! II M M II 
, 2g , i ** CTGC L G T TGCGACGCCGGCCA.CCAGTTCACGTGCAAGAGCAAGT737 3CAAG . . . CTCTTC7G3GTCTGCGACAG7GTGAACGAGT GCGGAGACAACA 1377 



1575 GCGACGAGCAGGGG7GCAG77G7CCGG • CCCAGACCT7CAGGTGT7CCAATG 33^GTGCCTCTCGVJ«3uCAGCr.G i ^ 1673 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 n i n n i m I n i m 1 1 1 1 1 1 1 1 1 1 1 1 1 ! i m m n f i i ; m i n 1 1 n i n u i n i i m m i i i i i i i 

13*78 GCGACGAGCAGGG77GCA777G7CCGGACCCAGACC77CAGG7G77CCAn7GGG^ A4 ' 

1674 GGGACGGG?CCGAC(MGCC7CCTGCCCCAAGG7GAra 1773 

1478 GG^CGGG7CCGAC^ j GAGCAAGGGCAA 1577 

1774 CCCTGAGTGTGACGGGAAGSAGCaCTGTAGCG^^ 1873 

i h i m 1 1 1 it 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 n 1 1 1 1 1 1 1 1 1 1 1 M 1 1 i i 1 1 1 1 1 1 1 1 1 1 1 u m n ! 1 1 m ii 1 1 1 n m iiiNinmiMimii 

1578 CCC7anG7G7^ 1677 
1674 GGC^CGGATGCGGATGAGGGCGnGTGGCCCTGGCAGGTAAGCC^ 1973 

1 1 1 1 1 1 m 1 1 1 1 1 1 1 1 1 ii 1 1 1 1 1 n 1 1 1 1 1 u 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ii i n u 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 ii i 'i'iii'iM I I I I I I I I I I 

1678 GGCACGGA7GCGGA7GAGGGCGAG7GGCCC7GGCAGG7A' i .GCC7GCA7GC7C7GGGCCAGGGCC t .CA7C7GCGG7GC77CCu i C-.i v. i C i CCCAACTGGC 1777 

1 974 T GG7CTC7GCCGCACAC7GC7ACA7CGA7GACAGAGGAT7CAGG7ACTCAG.-.CCCCACGCAG7GG^ 207 3 

J | | | | ! | J I 1 | M I I I I I I II I I I I i I I I I I I I II I I I II I 1 I III J I I I I I I I I 1 ! I I M I I i I I I I ! I I I I H I I 1 H I i I M II III I I I I I I I I 
1779 TGGTC7CTGCCGCACAC7GC7ACA7Cawa*CA6A^^ lt,7; > 

2074 C£GCGCCCC?GGGG7GCAG£-.GCGCAGGC7CA?.GCGCATCA7CT^^ 2173 

\\ I | | { j | | | | I I I I I I I I I I I I 1 I I I I I I II I I I I I I 1 I I I I I I M I I I I I II I I I I I I I I I M I I I II I I I I I I ! II I I I I ! I I II I I I I I I I M 
1876 CA. .GGCCC7GGGG7GCAG^.GCGCAGGC7CAnGCGCATCA7C?CCCACCCC77CTT 1973 

*?174 ©Ga"CCGSCA(M7ACSGC7CCA7GG7GCGGCC^ 2273 

I j I f| { t | | | I | t 1 It 1 I t M I I I I 1 I I I I I I I I I I I I I t I I I I t I I I I I I 1 i H I I I i 1 I I I I i I I HI I I I H I ! f : I J M J i i I I I I I I I I I I II I 
1974 ACCGGCAGAG7ACAGC7CCA7GG7GCGGCCCA7C7GCC7GCCGGACGCC7GCCA7G7C77CCC7GCCGGC 1 AGG7CA7C7GGG i CACGGGC : GoG 2073 

0074 aACACACCC-G7A7GanGGCAC7GGCGCGC7aVTCC7GC 2373 

I I 1 I I ) j [ I , I I ! I I j 1 I ! ! 1 I I! I I t I I i I [ I M 1 I I I 1 i I i ! M I I I I I ! I I I I I I I ) I i I I I I i I I I ! i I I I I I M I I I I i I I I I I 1 I I I I I I M I I 

2074 (MACACCC-STATGGAGGCACTGGCGCGCTCM^^ 2173 
2374 CACGCCGCGCA7GA7G7GCG7GGGC77CC7CAGCGGCGGCG7GG^ 2473 

II «•'«■• 

2174 CACGCCGCGCA 

->474 A7C77CCAGGCCGG7G7GG7GAGC7GGGGAGACGGC7GCGC7C-^ 2573 

H j| 1 I I I I 1 III I I I I 11 I I I I I I I I I I I I I 1 I I I I 1 I 1 ! I I I 1 I I II I I ! I I 1 I I I I I I I I I 1 1! I I ! 1 I I I I 1 i i IM II I I I I I I I I I I 

2274 ATC77CCAGGCCGG7GTGS7GAGC7GGGGASAC.GCTGCGC7^ 2372 

2574 j^aac*CTGG^TATAGGGGCCGGGGCCACCCWATGTGTACAOT * ■ 2670 

rit~iitiiiiiiiititviiJtiirffiftiiiiiitiiiiiiiiiiiiiitiiiftittiiiiiiiiiiiiiii>>i'*ii> lllf11111111 'I' 

2373 ajsaj^CTGGGGTATAC^GGCCGGGGCCACCCAAATGTGT^^ 2472 
2 67 1 GGnCCGC7GAC7GCACCAGCGCCCCCAGPA.CA7ACAC7G7a^ 2770 

1 nil iiiitiittii 111111111111111111111111 111111111 muni iiiiiiiiiiiiiiiiimiiinitiiii 

2473 GCACCG7GACC7GCACCAGCG . CCCCAGAACATACACTG7GAAC7C . A7C7CCAGG . . C7CAAA7C7G . C7AGAAAACC7C7CGC77CC7CAGCC 1 CCAA ,:6- 

2771 AG7GGAGC7GGGA . GGTAGAAGGGGAGG . ACAC7GG7GG77C7AC7^XCCAAX7GGGGGCAPAGG777a-Aa^ CACAGCC7CCCCCGCCAGCCCCAAGC 2 3 6e 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 1 I 1 1 U I I I I I I I I I I I 1 I I I Mill I I I I II I I I I I 
2566 AG7GGAGC7 GGGAG GG7AG 2 AG G GGAGGAACAC7 GG7G G 7 7 C 7AC7GAC CCAAC7 GGGG . .CAAGG777SAAG.CACAS. . . . CTCCGGCAGCCC . .AAG 265& 

2869 7GGGCCGAGGCGCG777G7G7ATATCTGeCTCCCCTCT^ 2968 

Mill I lUIIIMM III I I I I I I M I I I HI I I Ml IL.-™ r'rrr iCTrrrA 21^ 
2659 TGGGCGAGGACGCGTTTG7GCATA.CTGCC. CTGCTC7A7ACACGGAAGACC7SGA TC*Ci«G.oA Gi G7GAC7GCCGG 273- 

2969 A7C7GGGC-G-GGGGCCC77GGGCCACGC7C77GnGGAAGCCCAGGC^^ 3 <>68 

,11111 " [ 1 ] 1 hi mi I IM1UIIII II II I II IHUtlll II I II II I II 111111111111111(111 II II MIIIIIMI 

2736 A7C7GG. . ,CTGTGGTCC77GGCCMGCreCTTGAGGAAGCCCAGGCT^ 2 8 32 

3069 CCCAGGG7 GGAC77 CAG7 G7G7 G7 A7 77G7G7AAATGGG7A i .AACAA7 77A7 7 7C77777AAAVAAAAAAAAAV-AAA 314 7 

mm iiMiiiinitiiii iiiiiiiini mum 1 1 1 1 1 1 n 1 1 m 1 1 1 1 1 1 1 it 1 1 

2833 CCCAGG. . 7GAC77CAG7G7G7G7A . 77G7G7AAA7GAG7AA a AC^7777A777C77777AvAAA--AA-A 2500 



3CCGCGCATti.nl I* Hjcu i oot*u 1 i v.u i wvuluvjlu^u i cu.-.l : <-l i ut^-vwv: 1 i i\.w«v;*-^>.wiv»w ta . — 

I I I II M I I I I I I I I I I I I I I I 1 I 1 I I I M 1 I I I I I I i M ! I I I I I H I ! I I I I I I I 1 I I I I I I I I I M I I I M ! I I I I I I I i I I I I I I I I I I I I 1 1 
;CCGCGCAT(*TGTCOTGGGC77CC7CAGCGGC^ 2273 
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hereby state that the content of the paper copy of the Sequence Listing 
filed as part of the above-captioned application and the enclosed 
computer readable copy of the Sequence Listing are the same. 



Respectfully submitted, 



Date- 




Counsel for Applicant 
Registration No. 35,423 

McGregor & adler, llp 

8011 Candle Lane 
Houston, Texas 77071 
(713) 777-2321 



SEQUENCE LISTING 



<110> O'Brien, Timothy J. 

<120> TADG-15: An Extracellular Serine Protease 

Overexpressed in Carcinomas 

<130> D6064CIP 

<141> 10-20-1999 

<150> 09/027,337 

<151> 02-20-1998 

<160> 98 

<170> WORD 6.0.1 for Macintosh 



<210> 1 

<211> 3147 

<212> DNA 

<213> Homo sapiens 

<220> 

<223> TADG-15 
<400> 1 

tcaagagcgg cctcggggta ccatggggag 
gaaggacttc ggcgcgggac tcaagtacaa 
ggaaggcgtg gagttcctgc cagtcaacaa 
gcgctgggtg gtgctggcag ccgtgctgat 
cttcctggtg tggcatttgc agtaccggga 
catgaggatc acaaatgaga attttgtgga 
tgtaagcctg gccagcaagg tgaaggacgc 
cctgggcccc taccacaagg agtcggctgt 
ctactactgg tctgagttca gcatcccgca 
ggccgaggag cgcgtagtca tgctgccccc 
cacctcagtg gtggctttcc ccacggactc 
ctgcagcttt ggcctgcacg cccgcggtgt 
ccctgacagc ccctaccccg ctcatgcccg 
ctcagtgctg agcctcacct tccgcagctt 
cgacctggtg acggtgtaca acaccctgag 
gtgtggcacc taccctccct cctacaacct 
catcacactg ataaccaaca ctgagcggcg 
gctgcctagg atgagcagct gtggaggccg 
cccctactac ccaggccact acccacccaa 
caacaaccag catgtgaagg tgagcttcaa 
tgcgggcacc tgccccaagg actacgtgga 
gtcccagttc gtcgtcacca gcaacagcaa 
gtcctacacc gacaccggct tcttagctga 
cccggggcag ttcacgtgcc gcacggggcg 
ctgggccgac tgcaccgacc acagcgatga 



cgatcgggcc cgcaagggcg gagggggccc 60 
ctcccggcac gagaaagtga atggcttgga 120 
cgtcaagaag gtggaaaagc atggcccggg 180 
cggcctcctc ttggtcttgc tggggatcgg 240 
cgtgcgtgtc cagaaggtct tcaatggcta 300 
tgcctacgag aactccaact ccactgagtt 360 
gctgaagctg ctgtacagcg gagtcccatt 420 
gacggccttc agcgagggca gcgtcatcgc 480 
gcacctggtg gaggaggccg agcgcgtcat 540 
gcgggcgcgc tccctgaagt cctttgtggt 600 
caaaacagta cagaggaccc aggacaacag 660 
ggagctgatg cgcttcacca cgcccggctt 720 
ctgccagtgg gccctgcggg gggacgccga 780 
tgaccttgcg tcctgcgacg agcgcggcag 840 
ccccatggag ccccacgccc tggtgcagtt 900 
gaccttccac tcctcccaga acgtcctgct 960 
gcatcccggc tttgaggcca ccttcttcca 1020 
cttacgtaaa gcccagggga cattcaacag 1080 
cattgactgc acatggaaca ttgaggtgcc 1140 
attcttctac ctgctggagc ccggcgtgcc 1200 
gatcaatggg gagaaatact gcggagagag 1260 
caagatcaca gttcgcttcc actcagatca 1320 
atacctctcc tacgactcca gtgacccatg 1380 
gtgtatccgg aaggagctgc gctgtgatgg 1440 
gctcaactgc agttgcgacg ccggccacca 1500 



SEQ-1 



gttcacgtgc aagaacaagt tctgcaagcc 
ctgcggagac aacagcgacg agcaggggtg 
caatgggaag tgcctctcga aaagccagca 
gtccgacgag gcctcctgcc ccaaggtgaa 
ctgcctcaat gggctctgct tgagcaaggg 
tagcgacggc tcagatgaga aggactgcga 
tcgtgttgtt gggggcacgg atgcggatga 
tgctctgggc cagggccaca tctgcggtgc 
tgccgcacac tgctacatcg atgacagagg 
ggccttcctg ggcttgcacg accagagcca 
gctcaagcgc atcatctccc accccttctt 
gctgctggag ctggagaaac cggcagagta 
ggacgcctcc catgtcttcc ctgccggcaa 
ccagtatgga ggcactggcg cgctgatcct 
gaccacctgc gagaacctcc tgccgcagca 
cctcagcggc ggcgtggact cctgccaggg 
ggcggatggg cggatcttcc aggccggtgt 
gaacaagcca ggcgtgtaca caaggctccc 
tggggtatag gggccggggc cacccaaatg 
cccagtgtgc acgcctgcag gctggagact 
catacactgt gaactcaatc tccagggctc 
cagcctccaa agtggagctg ggaggtagaa 
actgggggca aaggtttgaa gacacagcct 
cgtttgtgta tatctgcctc ccctgtctgt 
tcctcagtga aggtggtggg gctgccggat 
tgaggaagcc caggctcgga ggaccctgga 
taccagctcc cagggtggac ttcagtgtgt 
ttctttttaa aaaaaaaaaa aaaaaaa 



cctcttctgg gtctgcgaca gtgtgaacga 1560 
cagttgtccg gcccagacct tcaggtgttc 1620 
gtgcaatggg aaggacgact gtggggacgg 1680 
cgtcgtcact tgtaccaaac acacctaccg 1740 
caaccctgag tgtgacggga aggaggactg 1800 
ctgtgggctg cggtcattca cgagacaggc 1860 
gggcgagtgg ccctggcagg taagcctgca 1920 
ttccctcatc tctcccaact ggctggtctc 1980 
attcaggtac tcagacccca cgcagtggac 2040 
gcgcagcgcc cctggggtgc aggagcgcag 2100 
caatgacttc accttcgact atgacatcgc 2160 
cagctccatg gtgcggccca tctgcctgcc 2220 
ggccatctgg gtcacgggct ggggacacac 2280 
gcaaaagggt gagatccgcg tcatcaacca 2340 
gatcacgccg cgcatgatgt gcgtgggctt 2400 
tgattccggg ggacccctgt ccagcgtgga 2460 
ggtgagctgg ggagacggct gcgctcagag 2520 
tctgtttcgg gactggatca aagagaacac 2580 
tgtacacctg cggggccacc catcgtccac 2640 
ggaccgctga ctgcaccagc gcccccagaa 2700 
caaatctgcc tagaaaacct ctcgcttcct 27 60 
ggggaggaca ctggtggttc tactgaccca 2820 
cccccgccag ccccaagctg ggccgaggcg 2880 
aaggagcagc gggaacggag cttcggagcc 2940 
ctgggctgtg gggcccttgg gccacgctct 3000 
aaacagacgg gtctgagact gaaattgttt 3060 
gtatttgtgt aaatgggtaa aacaatttat 3120 

3147 



<210> 2 

<211> 855 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> TADG-15 
<400> 2 

Met Gly Ser Asp Arg Ala Arg Lys Gly Gly Gly Gly Pro Lys Asp 

5 10 15 

Phe Gly Ala Gly Leu Lys Tyr Asn Ser Arg His Glu Lys Val Asn 

20 25 30 

Gly Leu Glu Glu Gly Val Glu Phe Leu Pro Val Asr. Asn Val Lys 

35 40 45 

Lys Val Glu Lys His Gly Pro Gly Arg Trp Val Val Leu Ala Ala 

50 55 60 

Val Leu He Gly Leu Leu Leu Val Leu Leu Gly He Gly Phe Leu 

65 70 75 

Val Trp His Leu Gin Tyr Arg Asp Val Arg Val Gin Lys Val Phe 

80 85 90 

Asn Glv Tyr Met Arg He Thr Asn Glu Asn Phe Val Asp Ala Tyr 

95 100 105 

SEQ-2 



Glu 


Asn 


Ser 


Asn 


Ser 
110 


Thr 


Glu 


Lys 


Asp 


Ala 


Leu 


Lys 
125 


Leu 


Leu 


Pro 


Tyr 


His 


Lys 


Glu 
140 


Ser 


Ala 


Val 


He 


Ala 


Tyr 


Tyr 
155 


Trp 


Ser 


Val 


Glu 


Glu 


Ala 


Glu 
170 


Arg 


Val 


Leu 


Pro 


Pro 


Arg 


Ala 
185 


Arg 


Ser 


Val 


Val 


Ala 


Phe 


Pro 
200 


Thr 


Asp 


Asp 


Asn 


Ser 


Cys 


Ser 
215 


Phe 


Gly 


Met 


Arg 


Phe 


Thr 


Thr 
230 


Pro 


Gly 


His 


Ala 


Arg 


Cys 


Gin 
245 


Trp 


Ala 


Leu 


Ser 


Leu 


Thr 


Phe 
260 


Arg 


Ser 


Arg 


Gly 


Ser 


Asp 


Leu 
275 


Val 


Thr 


Glu 


Pro 


His 


Ala 


Leu 
290 


Val 


Gin 


Tyr 


Asn 


Leu 


Thr 


Phe 


His 


Ser 








305 






Leu 


He 


Thr 


Asn 


Thr 
320 


Glu 


Arg 


Phe 


Phe 


Gin 


Leu 


Pro 
335 


Arg 


Met 


Lys 


Ala 


Gin 


Gly 


Thr 
350 


Phe 


Asn 


Pro 


Pro 


Asn 


He 


Asp 
365 


Cys 


Thr 


Gin 


His 


Val 


Lys 


Val 
380 


Ser 


Phe 


Gly 


Val 


Pro 


Ala 


Gly 
395 


Thr 


Cys 


Gly 


Glu 


Lys 


Tyr 


Cys 
410 


Gly 


Glu 


Asn 


Ser 


Asn 


Lys 


He 
425 


Thr 


Val 


Thr 


Asp 


Thr 


Gly 


Phe 
440 


Leu 


Ala 


Asp 


Pro 


Cys 


Pro 


Gly 
455 


Gin 


Phe 


Arg 


Lys 


Glu 


Leu 


Arg 
470 


Cys 


Asp 


Ser 


Asp 


Glu 


Leu 


Asn 
485 


Cys 


Ser 


Cys 


Lys 


Asn 


Lys 


Phe 
500 


Cys 


Lys 



Phe 


Val 


Ser 


Leu 


Ala 


Ser 


Lys 


Val 






115 










120 


Tyr 


Ser 


Gly 


Val 


Pro 


Phe 


Leu Gly 






130 










135 


Val 


Thr 


Ala 


Phe 


Ser 


Glu 


Gly 


Ser 






145 










150 


Glu 


Phe 


Ser 


He 


Pro 


Gin 


His 


Leu 






160 










165 


Met 


Ala 


Glu 


Glu 


Arg 


Val 


Val 


Met 






175 










180 


Leu 


Lys 


Ser 


Phe 


Val 


Val 


Thr 


Ser 






190 










195 


Ser 


Lys 


Thr 


Val 


Gin 


Arg 


Thr 


Gin 






205 










210 


Leu 


His 


Ala 


Arg Gly 


Val 


Glu 


Leu 






220 










225 


Phe 


Pro 


Asp 


Ser 


Pro 


Tyr 


Pro 


Ala 






235 










240 


Leu 


Arg 


Gly 


Asp 


Ala 


Asp 


Ser 


Val 






250 










255 


Phe 


Asp 


Leu 


Ala 


Ser 


Cys 


Asp 


Glu 






265 










270 


Val 


Tyr 


Asn 


Thr 


Leu 


Ser 


Pro 


Met 




280 










285 


Leu 


Cys 


Gly 


Thr 


Tyr 


Pro 


Pro 


Ser 






295 










300 


Ser 


Gin 


Asn 


Val 


Leu 


Leu 


He 


Thr 






310 










315 


Arg 


His 


Pro 


Gly 


Phe 


Glu 


Ala 


Thr 






325 










330 


Ser 


Ser 


Cys 


Gly Gly 


Arg 


Leu Arg 






340 










345 


Ser 


Pro 


Tyr 


Tyr 


Pro 


Gly 


His 


Tyr 






355 










360 


Trp 


Asn 


He 


Glu 


Val 


Pro 


Asn 


Asn 




370 










375 


Lys 


Phe 


Phe 


Tyr 


Leu 


Leu 


Glu 


Pro 




385 










390 


Pro 


Lys 


Asp 


Tyr Val 


Glu 


He 


Asn 






400 










405 


Arg 


Ser 


Gin 


Phe 


Val 


Val 


Thr 


Ser 




415 










420 


Arg 


Phe 


His 


Ser 


Asp 


Gin 


Ser 


Tyr 






430 










435 


Glu 


Tyr 


Leu 


Ser 


Tyr 


Asp 


Ser 


Ser 






445 










450 


Thr 


Cys 


Arg 


Thr 


Gly 


Arg 


Cys 


He 






460 










465 


Gly Trp 


Ala 


Asp 


Cys 


Thr 


Asp 


His 






475 










480 


Cys 


Asp 


Ala 


Gly His 


Gin 


Phe 


Thr 






490 










495 


Pro 


Leu 


Phe 


Trp 


Val 


Cys 


Asp 


Ser 



505 510 

SEQ-3 



Val 


Asn 


Asp 


Pro 


Ala 


Gin 


Ser 


Gin 


Gin 


Glu 


Ala 


Ser 


Thr 


Tyr 


Arg 


Glu 


Cys 


Asp 


Asp 


Cys 


Asp 


Val 


Gly 


Gly 


Ser 


Leu 


His 


He 


Ser 


Pro 


Asp 


Arg 


Gly 


Leu 


Gly 


Leu 


Glu 


Arg 


Arg 


Phe 


Thr 


Phe 


Ala 


Glu 


Tyr 


Ser 


His 


Val 


Gly 


His 


Thr 


Gly 


Glu 


He 


Pro 


Gin 


Gin 


Gly 


Gly 


Val 


Ser 


Val 


Glu 


Trp 


Gly 


Asp 


Arg 


Leu 


Pro 



<210> 3 

<211> 256 

<212> PRT 

<213> Homo 



Cys 


Gly 


Asp 




blb 




Thr 


Phe 


Arg 




530 




Cys 


Asn 


Gly 




i — yt | — 

545 




Cys 


Pro 


Lys 




560 




Cys 


Leu 


Asn 




575 




Gly 


Lys 


Glu 




590 




Cys 


Gly 


Leu 




605 




Thr 


Asp 


Ala 




620 




Ala 


Leu 


Gly 




635 




Asn 


Trp 


Leu 




650 




Phe 


Arg 


Tyr 




f f r~ 

665 




His 


Asp 


Gin 




680 




Leu 


Lys 


Arg 




695 




Asp 


Tyr 


Asp 




710 




Ser 


Ser 


Met 




725 




Phe 


Pro 


Ala 




740 




Gin 


Tyr 


Gly 




755 




Arg 


Val 


He 




770 




He 


Thr 


Pro 




785 




Asp 


Ser 


Cys 




800 




Ala 


Asp 


Gly 




815 




Gly 


Cys 


Ala 




830 




Leu 


Phe 


Arg 




845 





sapiens 



Asn 


Ser 


Asp 


Cys 


Ser 


Asn 


Lys 


Asp 


Asp 


Val 


Asn 


Val 


Gly 


Leu 


Cys 


Asp 


Cys 


Ser 


Arg 


Ser 


Phe 


Asp 


Glu 


Gly 


Gin 


Gly 


His 


Val 


Ser 


Ala 


Ser 


Asp 


Pro 


Ser 


Gin 


Arg 


He 


He 


Ser 


He 


Ala 


Leu 


Val 


Arg 


Pro 


Gly 


Lys 


Ala 


Gly 


Thr 


Gly 


Asn 


Gin 


Thr 


Arg 


Met 


Met 


Gin 


Gly 


Asp 


Arg 


He 


Phe 


Gin 


Arg 


Asn 


Asp 


Trp 


He 



Glu 


Gin 


Gly 


C O A 

520 






Gly 


Lys 


Cys 


535 






Cys 


Gly 


Asp 


rrn 

550 






Val 


Thr 


Cys 


565 






Leu 


Ser 


Lys 


580 






Asp 


Gly 


Ser 


r~ /->, r— 

595 






Thr 


Arg 


Gin 


610 






Glu 


Trp 


Pro 


625 






He 


Cys 


Gly 


640 






Ala 


His 


Cys 


655 






Thr 


Gin 


Trp 


670 






Ser 


Ala 


Pro 


685 






His 


Pro 


Phe 


700 






Leu 


Glu 


Leu 


715 






He 


Cys 


Leu 


730 






He 


Trp 


Val 


745 






Ala 


Leu 


He 


760 






Thr 


Cys 


Glu 


775 






Cys 


Val 


Gly 


790 






Ser 


Gly 


Gly 


805 






Gin 


Ala 


Gly 


820 






Lys 


Pro 


Gly 


835 






Lys 


Glu 


Asn 


850 







Cys 


Ser 


Cys 






525 


Leu 


Ser 


Lys 






540 


Gly 


Ser 


Asp 






555 


Thr 


Lys 


His 






570 


Gly 


Asn 


Pro 






585 


Asp 


Glu 


Lys 






600 


Ala 


Arg 


Val 






615 


Trp 


Gin 


Val 






630 


Ala 


Ser 


Leu 






645 


Tyr 


He 


Asp 






660 


Thr 


Ala 


Phe 






675 


Gly 


Val 


Gin 






690 


Phe 


Asn 


Asp 






705 


Glu 


Lys 


Pro 






720 


Pro 


Asp 


Ala 






735 


Thr 


Gly 


Trp 






750 


Leu 


Gin 


Lys 






765 


Asn 


Leu 


Leu 






780 


Phe 


Leu 


Ser 






795 


Pro 


Leu 


Ser 






810 


Val 


Val 


Ser 






825 


Val 


Tyr 


Thr 






840 


Thr 


Gly 


Val 






855 



SEQ 



i-4 



<220> 



<223> Heps in 
<400> 3 



Arg lie 


Val 


Gly Gly 


Arg Asp 


Thr 


Ser 


Leu 


Gly Arg Trp 


Pro 


Trp 








5 








10 






15 


Gin Val 


Ser 


Leu Arg 


Tyr Asp 


Gly 


Ala 


His 


Leu Cys Gly 


Gly 


Ser 








20 








25 






30 


Leu Leu 


Ser 


Gly Asp 


Trp Val 


Leu 


Thr 


Ala 


Ala His Cys 


Phe 


Pro 








35 








40 






45 


Glu Arg 


Asn 


Arg Val 


Leu Ser 


Arg 


Trp 


Arg 


Val Phe Ala 


Gly 


Ala 








50 








55 






60 


Val Ala 


Gin 


Ala 


Ser 


Pro His 


Gly 


Leu 


Gin 


Leu Gly Val 


Gin 


Ala 








65 








70 






75 


Val Val 


Tvr 


His 


Glv 


Gly Tyr 


Leu 


Pro 


Phe 


Arg Asp Pro 


Asn 


Ser 








80 








85 






90 


Glu Glu 


Asn 


Ser 


Asn 


Asp He 


Ala 


Leu 


Val 


His Leu Ser 


Ser 


Pro 








95 








100 






105 


TiPtl Pto 

j_i Vw y*A. J- .1- Vw/ 


Leu 


Thr* 


rii ii 

\J J_ Li 


Tyr He 


Gin 


Pro 


Val 


Cys Leu Pro 


Ala 


Ala 








110 








115 






120 


Glv Gin 


Ala 


Leu 


Val 


Asp Gly 


Lys 


He 


Cys 


Thr Val Thr 


Gly 


Trp 








125 








130 






135 


Glv Asn 


Thr 


Gin 


Tyr 


Tvr Glv 


Gin 


Gin 


Ala 


Gly Val Leu 


Gin 


Glu 






140 








145 






150 


Ala Arg 


Val 


Pro 


lie 


He Ser 


Asn 


Asp 


Val 


Cys Asn Gly 


Ala 


Asp 






155 








160 






165 


Phe Tyr 


Glv 


Asn 


Gin 


He Lys 


Pro 


Lvs 


Met 


Phe Cys Ala 


Gly 


Tyr 








x / u 








175 






180 


Pro Glu 


Gly 


Gly 


He 


Asp Ala 


Cys 


Gin 


Gly 


Asp Ser Gly 


Gly 


Pro 








185 








x y u 






_L _/ -J 


Phe Val 


Cys 


Glu 


Asp 


Ser He 


Ser 


Arg 


Thr 


Pro Arg Trp 


Arg 


Leu 








200 








9 OR 

ZUJ 






91 n 

/j J_ u 


Cys Gly 


He 


Val 


Ser 


Trp Gly 


Thr 


Gly 


Cys 


Ala Leu Ala 


Gin 


Lys 








215 








220 






225 


irro O-Ly 


v ax 


Tyr 


Thr 


j-iyib vdx 


Cor 
OCX 


7\ on 


XT lit! 


Arvr nln H^tts 




JTllv^ 








230 








235 






240 


Gin Ala 


He 


Lys 


Thr 


His Ser 


Glu 


Ala 


Ser 


Gly Met Val 


Thr 


Gin 








245 








ZDU 






9 ^ 
A O D 


Leu 






















<210> - 


4 




















<211> : 


225 




















<212> : 


PRT 




















<213> ffomo 


sapiens 
















<220> 






















<223> : 


SCCE 




















<400> - 


4 




















Lys lie 


He 


Asp 


Gly 


Ala Pro 


Cys 


Ala 


Arg 


Gly Ser His 


Pro 


Trp 








5 








10 






15 



SEQ-5 



Gin 


Val 


Ala 


Leu 


Leu 


Ser Gly 










20 






Leu 


Val 


Asn 


Glu 


Arg 


Trp Val 










35 






Asn 


Glu 


Tyr 


Thr 


Val 
50 


His 


Leu 


Arg 


Ala 


Gin Arg 


lie 


Lys 


Ala 










65 






Tyr 


Ser 


Thr 


Gin 


Thr 
80 


His 


Val 


Asn 


Ser 


Gin 


Ala 


Arg 
95 


Leu 


Ser 


Pro 


Ser 


Arg 


Cys 


Glu 
110 


Pro 


Pro 


Trp 


Gly 


Thr 


Thr 


Thr 
125 


Ser 


Pro 


Met 


Cys 


Val 


Asp 


Val 
140 


Lys 


Leu 


Val 


Tyr 


Lys 


Asp 


Leu 
155 


Leu 


Glu 


Pro 


Asp 


Ser 


Lys 


Lys 
170 


Asn 


Ala 


Leu 


Val 


Cys 


Arg 


Gly 
185 


Thr 


Leu 


Phe 


Pro 


Cys 


Gly 


Gin 
200 


Pro 


Asn 


Cys 


Lys 


Phe 


Thr 


Lys 
215 


Trp 


He 



<210> 5 

<211> 225 

<212> PRT 

<213> Homo sapiens 

<220> 



<223> Trypsin 
<400> 5 



Lys 


He 


Val 


Gly 


Gly 
5 


Tyr Asn 


Gin 


Val 


Ser 


Leu 


Asn 


Ser Gly 










20 




He 


Asn 


Glu 


Gin 


Trp 


Val Val 










35 




Arg 


He 


Gin 


Val 


Arg 


Leu Gly 










50 




Gly Asn 


Glu 


Gin 


Phe 


He Asn 










65 




Gin 


Tyr 


Asp 


Arg 


Lys 


Thr Leu 










80 




Leu 


Ser 


Ser 


Arg 


Ala 


Val He 










95 





Asn 


Gin 


Leu 


His 


Cys 


Gly 


Gly 


Val 






O IT" 

25 










3 0 


Leu 


Thr 


TV ~l 

Ala 


TV "1 

Ala 


His 


Cys 


Lys 


Met 






40 










45 


Gly 


Ser 


Asp 


Thr 


Leu 


Gly 


Asp 


Arg 






55 










60 


Ser 


Lys 


Ser 


Phe 


Arg 


His 


Pro 


Gly 






70 










75 


Asn 


Asp 


Leu 


Met 


Leu 


Val 


Lys 


Leu 






85 










90 


Ser 


Met 


Val 


Lys 


Lys 


Val 


Arg 


Leu 






100 










105 


Gly Thr 


Thr 


Cys 


Thr 


Val 


Ser 


Gly 






115 










120 


Asp Val 


Thr 


Phe 


Pro 


Ser 


Asp 


Leu 






130 










1 3d 


He 


Ser 


Pro 


Gin 


Asp 


Cys 


ml 

Thr 


Lys 






145 










150 


Asn 


Ser 


Met 


Leu 


Cys 


Ala 


Gly 


He 






160 










165 


Cys 


Asn 


Gly 


Asp 


Ser 


Gly 


Gly 


Pro 






175 










180 


Gin 


Gly 


Leu 


Val 


Ser 


Trp 


Gly 


Thr 






190 










195 


Asp 


Pro 


Gly 


Val 


Tyr 


Thr 


Gin 


Val 






205 










210 


Asn Asp 


Thr 


Met 


Lys 


Lys 


His 


Arg 






220 










225 


Cys 


Glu 


Glu 


Asn 


Ser 


Val 


Pro 


Tyr 






10 










15 


Tyr 


His 


Phe 


Cys 


Gly 


Gly 


Ser 


Leu 






25 










30 


Ser 


Ala 


Gly 


His 


Cys 


Tyr 


Lys 


Ser 






40 










45 


Glu 


His 


Asn 


He 


Glu 


Val 


Leu 


Glu 






55 










60 


Ala 


Ala 


Lys 


He 


He 


Arg 


His 


Pro 






70 










75 


Asn 


Asn 


Asp 


He 


Met 


Leu 


He 


Lys 






85 










90 


Asn 


Ala 


Arg 


Val 


Ser 


Thr 


He 


Ser 






100 










105 



SEQ-6 



Leu Pro Thr 


Ala 


Pro 


Pro Ala Thr Gly 


Thr 


Lys 


Cys Leu 


He Ser 






110 








115 






l^U 


Gly Trp Gly Asn 


Thr 


Ala 


Ser 


Ser Gly 


Ala 


Asp 


Tyr Pro 


Asp Glu 






125 








130 






135 


Leu Gin Cys 


Leu 


Asp 


Ala 


Pro 


Val Leu 


Ser 


Gin 


Ala Lys 


Cys Glu 






140 








145 






150 


Ala Ser Tyr 


Pro 


Gly 


Lys 


He 


Thr Ser 


Asn 


Met 


Phe Cys 


Val Gly 






155 








160 






165 


Phe Leu Glu 


Gly 


Gly 


Lys 


Asp 


Ser Cys 


Gin 


Gly Asp Ser Gly Gly 






170 








•1 rjr 

175 






180 


Pro Val Val 


Cys 


Asn 


Gly Gin Leu Gin 


Gly 


Val 


Val Ser Trp Gly 






185 








190 






195 


Asp Gly Cys 


Ala 


Gin 


Lys 


Asn 


Lys Pro 


Gly 


Val 


Tyr Thr 


Lys Val 




200 








205 






210 


Tyr Asn Tyr Val 




Trp 


He 


Lys Asn 


Thr 


He 


Ala Ala 


Asn Ser 






215 








220 






225 


<210> 6 




















<211> 231 




















<212> PRT 




















<213> Homo sapiens 
















<220> 




















<223> Chymo trypsin 
















<400> 6 




















Arg lie Val 


Asn 


Gly 


Glu 


Asp 


Ala Val 


Pro 


Gly 


Ser Trp 


Pro Trp 




5 








10 






15 


Gin Val Ser 


Leu 


Gin 


Asp 


Lys 


Thr Gly 


Phe 


His 


Phe Cys 


Gly Gly 






20 








25 






30 


Ser Leu lie 


Ser 


Glu 


Asp 


Trp 


Val Val 


Thr 


Ala 


Ala His 


Cys Gly 






35 








40 






45 


Val Arg Thr 


Ser 


Asp 


Val 


Val 


Val Ala 


Gly 


Glu 


Phe Asp Gin Gly 




50 








55 






60 


Ser Asp Glu 


Glu 


Asn 


He 


Gin 


Val Leu 


Lys 


He 


Ala Lys 


Val Phe 




65 








70 






75 


Lys Asn Pro 


Lys 


Phe 


Ser 


He 


Leu Thr 


Val 


Asn Asn Asp 


lie Thr 


80 








85 






90 


Leu Leu Lys 


Leu 


Ala 


Thr 


Pro 


Ala Arg 


Phe 


Ser 


Gin Thr 


Val Ser 




95 








100 






105 


Ala Val Cys 


Leu 


Pro 


Ser 


Ala 


Asp Asp 


Asp 


Phe 


Pro Ala Gly Thr 




110 








115 






120 


Leu Cys Ala 


Thr 


Thr 


Gly Trp Gly Lys 


Thr 


Lys Tyr Asn Ala Asn 




125 








130 






135 


Lys Thr Pro 


Asp 


Lys 


Leu 


Gin 


Gin Ala 


Ala 


Leu 


Pro Leu 


Leu Ser 




140 








145 






150 


Asn Ala Glu 


Cys 


Lys 


Lys 


Ser 


Trp Gly 


Arg 


Arg 


He Thr 


Asp Val 






155 








160 






165 


Met lie Cys 


Ala 


Gly 


Ala 


Ser Gly Val 


Ser 


Ser 


Cys Met 


Gly Asp 




170 








175 






180 


Ser Gly Gly 


Pro 


Leu 


Val 


Cys 


Gin Lys 


Asp 


Gly Ala Trp Thr Leu 




185 








190 






195 



SEQ-7 



Val Gly He Val Ser Trp Gly 

200 

Pro Gly Val Tyr Ala Arg Val 

215 

Lys He Leu Ala Ala Asn 

230 



<210> 7 

<211> 255 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Factor 7 



<400> 7 



Arg 


He 


Val 


Gly 


Gly 
5 


Lys Val 


Gin 


Val 


Leu 


Leu 


Leu 


Val Asn 










20 




Leu 


He 


Asn 


Thr 


He 


Trp Val 










35 




Lys 


He 


Lys 


Asn 


Trp 


Arg Asn 










50 




Asp 


Leu 


Ser 


Glu 


His 


Asp Gly 










65 




Gin 


Val 


He 


He 


Pro 


Ser Thr 










80 




Asp 


He 


Ala 


Leu 


Leu 


Arg Leu 










95 




His 


Val 


Val 


Pro 


Leu 


Cys Leu 










110 




Thr 


Leu 


Ala 


Phe 


Val 


Arg Phe 










125 




Leu 


Leu 


Asp 


Arg 


Gly 


Ala Thr 










140 




Val 


Pro 


Arg 


Leu 


Met 


Thr Gin 










155 




Val 


Gly 


Asp 


Ser 


Pro 


Asn He 










170 




Tyr 


Ser 


Asp 


Gly 


Ser 


Lys Asp 










185 




Pro 


His 


Ala 


Thr 


His 


Tyr Arg 










200 




Val 


Ser 


Trp 


Gly 


Gin 


Gly Cys 










215 




Tyr 


Thr 


Arg 


Val 


Ser 


Gin Tyr 










230 




Arg 


Ser 


Glu 


Pro 


Arg 


Pro Gly 










245 





Ser 


Asp 


Thr 


Cys Ser 


Thr 


Ser 


Ser 






205 








210 


Thr 


Lys 


Leu 


He Pro 


Trp 


Val 


Gin 






220 








225 


Cys 


Pro 


Lys 


Gly Glu 


Cys 


Pro 


Trp 






10 








15 


Gly Ala 


Gin 


Leu Cys 


Gly 


Gly 


Thr 






25 








30 


Val 


Ser 


Ala 


Ala His 


Cys 


Phe 


Asp 






40 








45 


Leu 


He 


Ala 


Val Leu Gly 


Glu 


His 






55 








60 


Asp 


Glu 


Gin 


Ser Arg 


Arg 


Val 


Ala 






70 








75 


Tyr 


Val 


Pro 


Gly Thr 


Thr 


Asn 


His 






85 








90 


His 


Gin 


Pro 


Val Val 


Leu 


ml 

Thr 


Asp 






100 








105 


Pro 


Glu 


Arg 


Thr Phe 


Ser 


Glu 


Arg 






115 








120 


Ser 


Leu 


Val 


Ser Gly Trp 


Gly 


Gin 






130 








135 


Ala 


Leu 


Glu 


Leu Met 


Val 


Leu 


Asn 






145 








150 


Asp 


Cys 


Leu 


Gin Gin 


Ser 


Arg 


Lys 






160 








165 


Thr 


Glu 


Tyr 


Met Phe 


Cys 


Ala 


Gly 






175 








180 


Ser 


Cys 


Lys 


Gly Asp 


Ser 


Gly 


Gly 






190 








195 


Gly Thr 


Trp 


Tyr Leu 


Thr 


Gly 


He 






205 








210 


Ala 


Thr 


Val 


Gly His 


Phe 


Gly 


Val 






220 








225 


He 


Glu 


Trp 


Leu Gin 


Lys 


Leu 


Met 






235 








240 


Val 


Leu 


Leu 


Arg Ala 


Pro 


Phe 


Pro 






250 








255 



SEQ-8 



<210> 8 

<211> 253 

<212> PRT 

<213> Homo sapiens 

<220> 



<223> Tissue plasminogen activator 
<400> 8 



Arg 


He 


Lys 


Gly Gly 


Leu 


Phe 


Ala 


Asp 


He 


Ala 


Ser 


His 


Pro 


Trp 








5 










10 










15 


Gin 


Ala 


Ala 


He Phe 


Ala 


Lys 


His 


Arg 


Arg 


Ser 


Pro 


Gly 


Glu 


Arg 








20 










25 










30 


Phe 


Leu 


Cys 


Gly Gly 


He 


Leu 


He 


Ser 


Ser 


Cys 


Trp 


He 


Leu 


Ser 








35 










40 










45 


Ala 


Ala 


His 


Cys Phe 


Gin 


Glu 


Arg 


Phe 


Pro 


Pro 


His 


His 


Leu 


Thr 








50 










55 










60 


Val 


He 


Leu 


Gly Arg 


Thr 


Tyr 


Arg 


Val 


Val 


Pro 


Gly 


Glu 


Glu 


Glu 








65 










70 










75 


Gin 


Lys 


Phe 


Glu Val 


Glu 


Lys 


Tyr 


He 


Val 


His 


Lys 


Glu 


Phe 


Asp 






80 










85 










90 


Asp 


Asp 


Thr 


Tvr As"D 


Asn 


Asp 


He 


Ala 


Leu 


Leu 


Gin 


Leu 


Lvs 


Ser 






95 










100 










105 


Asp 


Ser 


Ser 


Arg Cys 


Ala 


Gin 


Glu 


Ser 


Ser 


Val 


Val 


Arg 


Thr 


Val 








110 










115 










120 


Cys 


Leu 


Pro 


Pro Ala 


Asp 


Leu 


Gin 


Leu 


Pro 


Asp 


Trp 


Thr 


Glu 


Cys 






125 










130 










135 


Glu 


Leu 


Ser 


Gly Tyr 


Gly 


Lys 


His 


Glu 


Ala 


Leu 


Ser 


Pro 


Phe 


Tyr 








140 










145 










150 


Ser 


Glu 


Arg 


Leu Lys 


Glu 


Ala 


His 


Val 


Arg 


Leu 


Tyr 


Pro 


Ser 


Ser 








155 










160 










165 


Arg 


Cys 


Thr 


Ser Gin 


His 


Leu 


Leu 


Asn 


Arg 


Thr 


Val 


Thr 


Asp 


Asn 




170 










175 










180 


Met 


Leu 


Cys 


Ala Gly 


Asp 


Thr 


Arg 


Ser 


Gly 


Gly 


Pro 


Gin 


Ala 


Asn 








185 










190 










195 


Leu 


His 


Asp 


Ala Cys 


Gin 


Gly 


Asp 


Ser 


Gly 


Gly 


Pro 


Leu 


Val 


Cys 








200 










205 










210 


Leu 


Asn 


Asp 


Gly Arg 


Met 


Thr 


Leu 


Val 


Gly 


He 


He 


Ser 


Trp 


Gly 








215 










220 










225 


Leu 


Gly 


Cys 


Gly Gin 


L y s 


Asp 


Val 


Pro 


Gly 


Val 


Tyr 


Thr 


Lys 


Val 








230 










235 










240 


Thr 


Asn 


Tyr 


Leu Asp 


Trp 


He 


Arg 


Asp 


Asn 


Met 


Arg 


Pro 







245 250 

<210> 9 

<211> 2900 

<212> DNA 

<213> Homo sapiens 

<220> 

SEQ-9 



<223> SNC-19; GeneBank Accession No. #1720428 



<400> 9 

cgctgggtgg tgctggcagc cgtgctgatc 
ttcctggtgt ggcatttgca gtaccgggac 
atgaggatca caaatgagaa ttttgtggat 
gtaagcctgg ccagcaaggt gaaggacgcg 
ctgggcccct accacaagga gtcggctgtg 
tactactggt ctgagttcag catcccgcag 
gccaggagcg cgtagtcatg ctgcccccgc 
cctcagtggt ggctttcccc acggactcca 
gcagctttgg cctgcacgcc gcggtgtgga 
gacagcccct accccgctca tgcccgctgc 
gagctactcg agctgactcg cagcttgact 
gacgtgtaca acaccctgag ccccatggag 
cctccctcct acaacctgac cttccactcc 
taaccaacac tgacgcggca tcccggcttt 
agcagctgtg gaggccgctt acgtaaagcc 
ggccactacc cacccaacat tgactgcaca 
gtgaaggtgc gcttcaaatt cttctacctg 
cccaaggact acgtggagat caatggggag 
gtcaccagca acagcaacaa gatcacagtt 
accggcttct tagctgaata cctctcctac 
acgtgccgca cggggcggtg tatccggaag 
ccgaccacag cgatgagctc aactgcagtt 
gcaagttctg caagctcttc tgggtctgcg 
acgagcaggg ttgcatttgt ccggacccag 
tcgaaaagcc agcagtgcaa tgggaaggac 
tgccccaagg tgaacgtcgt cacttgtacc 
tgcttgagca agggcaaccc tgagtgtgac 
gagaaggact gcgactgtgg gctgcggtca 
acggatgcgg atgagggcga gtggccctgg 
cacatctgcg gtgcttccct catctctccc 
atcgatgaca gaggattcag gtactcagac 
cgaccagagc cagcgcaggc cctggggtgc 
accccttctt caatgacttc accttcgact 
cggcagagta cagctccatg gtgcggccca 
ctgccggcaa ggccatctgg gtcacgggct 
cgctgatcct gcaaaagggt gagatccgcg 
tgccgcagca gatcacgccg cgcatgatgt 
cctgccaggg tgattccggg ggacccctgt 
aggccggtgt ggtgagctgg ggagacgctg 
aaggctccct ctgtttcggg aatggatcaa 
acccaaatgt gtacacctgc ggggccaccc 
ctggagactc gcgcaccgtg acctgcacca 
tccaggctca aatctgctag aaaacctctc 
gggtagaagg ggaggaacac tggtggttct 
cagctccggc agcccaagtg ggcgaggacg 
ggaagacctg gatctctagt gagtgtgact 
ttcttgagga agcccaggct cggaggaccc 
ggtttaccag ctcccaggtg acttcagtgt 
tttcttttta aaaaaaaaaa 



ggcctcctct tggtcttgct ggggatcggc 60 
gtgcgtgtcc agaaggtctt caatggctac 120 
gcctacgaga actccaactc cactgagttt 180 
ctgaagctgc tgtacagcgg agtcccattc 240 
acggccttca gcgagggcag cgtcatcgcc 300 
cacctggttg aggaggccga gcgcgtcatg 3 60 
gggcgcgctc cctgaagtcc tttgtggtca 420 
aaacagtaca gaggacccag gacaacagct 480 
gctgatgcgc ttcaccacgc cggcttccct 540 
cagtgggctg cggggacgcg acgcagtgct 600 
gcgcctcgac gagcgcggca gcgacctggt 660 
ccccacgcct ggtgagtgtg tggcacctac 720 
ctcccacgaa cgtcctgctc atcacactga 780 
gaggccacct tcttccagct gcctaggatg 840 
caggggacat tcaacagccc ctactaccca 900 
tggaaaattg aggtgcccaa caaccagcat 960 
ctggagcccg gcgtgcctgc gggcacctgc 1020 
aaatactgcg gagagaggtc ccagttcgtc 1080 
cgcttccact cagatcagtc ctacaccgac 1140 
gactccagtg acccatgccc ggggcagttc 1200 
gagctgcgct gtgatggctg ggcgactgca 1260 
gcgacgccgg ccaccagttc acgtgcaaga 1320 
acagtgtgaa cgagtgcgga gacaacagcg 1380 
accttcaggt gttccaatgg gaagtgcctc 1440 
gactgtgggg acgggtccga cgaggcctcc 1500 
aaacacacct accgctgcct caatgggctc 1560 
gggaaggagg actgtagcga cggctcagat 1620 
ttcacgagac aggctcgtgt tgttgggggc 1680 
caggtaagcc tgcatgctct gggccagggc 1740 
aactggctgg tctctgccgc acactgctac 1800 
cccacgcagg acggccttcc tgggcttgca 1860 
aggagcgcag gctcaagcgc atcatctccc 1920 
atgacatcgc gctgctggag ctggagaaac 1980 
tctgcctgcc ggacgcctgc catgtcttcc 2040 
ggggacacac ccagtatgga ggcactggcg 2100 
tcatcaacca gaccacctgc gagaacctcc 2160 
gcgtgggctt cctcagcggc ggcgtggact 2220 
ccagcgtgga ggcggatggg cggatcttcc 2280 
cgctcagagg aacaagccag gcgtgtacac 2340 
agagaacact ggggtatagg ggccggggcc 2400 
atcgtccacc ccagtgtgca cgcctgcagg 2460 
gcgccccaga acatacactg tgaactcatc 2520 
gcttcctcag cctccaaagt ggagctggga 2580 
actgacccaa ctggggcaag gtttgaagca 2640 
cgtttgtgca tactgccctg ctctatacac 2700 
gccggatctg gctgtggtcc ttggccacgc 2760 
tggaaaacag acgggtctga gactgaaaat 2820 
gtgtattgtg taaatgagta aaacatttta 2880 

2900 



SEQ-10 



<210> 10 



<211> 922 

<212> PRT 

<213> Mus musculus 

<220> 



<223> Epithin 
<400> 10 



Met 


Gly 


Ser 


Asn 


Arg 


Glv 


Arcr 


Lys 


Ala 


Glv 


Gly 


Glv 


Ser 


Gin 


Asp 










5 










10 










15 


Phe 


Gly 


Ala 


Gly 


Leu 


Lys 


Tvr 


Asp 


Ser 


Arg 


Leu 


Glu 


Asn 


Met 


Asn 










20 










25 










30 




Phe> 

-L J. ±v3- 




Glu 


Glv 


Val 


Glu 


Phe 


Leu 


Pro 


Ala 


Asn 


Asn 


Ala 


Lys 








~J -J 










40 










45 




\Ta 1 

V CLX 


VJ JL LX 


J— i_y o 


Arrr 
.MX y 


Gly 


Pro 


•"•x y 


Arcr 

rix. y 




Val 


Val 


Leu 


Val 


Ala 










so 










55 










60 


V ClJL. 


UC LX 


XT XJLC. 




rue 


XJ <C- LX 


Leu 


Leu 




Leu 


Met 


Ala 


Gly 


Leu 


Leu 










\_> *j 










70 










75 


V CL X 






Phe 


His 




Arg 


Asn 


Val 


Arg 


Val 


Gin 


Lys 


Val 


Phe 










80 










85 










90 


Asn 


Glv 


His 


Leu 


Arg 


Ile 


Thr 


Asn 


Glu 


lie 


Phe 


Leu 


Asp 


Ala 


Tvr 










95 










100 










105 


Glu 


Asn 


Ser 


Thr 


Ser 


Thr 


Glu 


Phe 


lie 


Ser 


Leu 


Ala 


Ser 


Gin 


Val 










110 










115 










120 


T A/Q 


Glu 


Ala 


Leu 


LyS 


Leu 


Leu 


Tvr 


Asn 


Glu 


Val 


Pro 


Val 


Leu 


Glv 








125 










130 










135 


JT i_ O 




±xx o 


x_i_y o 


T A/Q 

J: 




Ala 


Val 


Thr 


Ala 


Phe 


Ser 


Glu 


Gly 


Ser 










1 40 










145 










150 


Va 1 

V CL _1_ 


He 


Ala 


Tvr 

x_y x 


«y 


x x^y 


Ser 


Glu 


Phe 


Ser 


lie 


Pro 


Pro 


His 


Leu 










j > ~) 










160 










165 


Al ;=» 


ril ii 

VjJL LI 


VJJ. Li 


V CLX 




rixy 


Ala 


Met 


Ala 


Val 


Glu 


Arg 


Val 


Val 


Thr 










17 0 
x / u 










175 

X / ~J 










180 


Leu 


Jr x u 


X X \J 


•"■X y 


Al a 

XH.X CL 


rix y 


Ala 






*J ^X. 


Phe 


Val 


Leu 


Thr 


Ser 










lOJ 










190 










195 


Val 


Val 


Ala 


Phe 


Pro 


lie 


Asp 


Pro 


Arg 


Met 


Leu 


Gin 


Arg 


Thr 


Gin 










200 










205 










210 


Asp 


Asn 


Ser 


Cys 


Ser 


Phe 


Ala 


Leu 


His 


Ala 


His 


Gly 


Ala 


Ala 


Val 








215 










220 










225 


Thr 


Arg 


Phe 


Thr 


Thr 


Pro 


Gly 


Phe 


Pro 


Asn 


Ser 


Pro 


Tyr 


Pro 


Ala 










230 










235 










240 


His 


Ala 


Arg 


Cys 


Gin 


Trp 


Val 


Leu 


Arg 


Gly 


Asp 


Ala 


Asp 


Ser 


Val 










245 










250 










255 


Leu 


Ser 


Leu 


Thr 


Phe 


Arg 


Ser 


Phe 


Asp 


Val 


Ala 


Pro 


Cys 


Asp 


Glu 










260 










265 










270 


His 


Gly 


Ser 


Asp 


Leu 


Val 


Thr 


Val 


Tyr 


Asp 


Ser 


Leu 


Ser 


Pro 


Met 








275 










280 










285 


Glu 


Pro 


His 


Ala 


Val 


Val 


Arg 


Leu 


Cys 


Gly 


Thr 


Phe 


Ser 


Pro 


Ser 










290 










295 










300 


Tyr 


Asn 


Leu 


Thr 


Phe 


Leu 


Ser 


Ser 


Gin 


Asn 


Val 


Phe 


Leu 


Val 


Thr 








305 










310 










315 


Leu 


He 


Thr 


Asn 


Thr 


Gly 


Arg 


Arg 


His 


Leu 


Gly 


Phe 


Glu 


Ala 


Thr 



SEQ-ll 



Phe 


Phe 


Gin 


Asp 


Thr 


Gin 


Pro 


Pro 


Asn 


Arg 


Asn 


Val 


Asn 


Val 


Pro 


Gly 


Glu 


Lys 


Asn 


Ser 


Ser 


Thr 


Asp 


Thr 


Asp 


Pro 


Cys 


Arg 


Lys 


Glu 


Ser 


Asp 


Glu 


Cys 


Lys 


Asn 


Val 


Asn 


Asp 


Pro 


Ala 


Gly 


Ser 


Gin 


Lys 


Glu 


Ala 


Ser 


Thr 


Tyr 


Arg 


Glu 


Cys 


Asp 


Asn 


Cys 


Asp 


Val 


Gly 


Gly 


Ser 


Leu 


His 


He 


Ser 


Pro 


Asp 


Lvs 


Asn 


Leu 


Gly 


Leu 


Glu 


Leu 


Lys 


Phe 


Thr 


Phe 


Val 


Glu 


Tyr 





320 




Leu 


Pro 


Lys 




335 




Gly 


Thr 


Phe 




350 




He 


Asn 


Cys 




365 




Lys 


Val 


Arg 




380 




Val 


Gly 


Ser 




395 




Gly 


Ser 


Gly 




410 




Lys 


He 


Thr 




425 




Gly 


Phe 


Leu 




A A f\ 

440 




Pro 


Gly 


Met 




455 




Leu 


Arg 


Cys 




470 




Arg 


Tyr 


Cys 




485 




Gin 


Phe 


Cys 




500 




Cys 


Gly 


Asp 




515 




Ser 


Phe 


Lys 




530 




Cys 


Asn 


Gly 




545 




Cys 


Asp 


Ser 




560 




Cys 


Gin 


Asn 




575 




Gly 


Lys 


Thr 




590 




Cys 


Gly 


Leu 




605 




Thr 


Asn 


Ala 




620 




Ala 


Leu 


Gly 




635 




Asp 


Trp 


Leu 




650 




Phe 


Lys 


Tyr 




665 




Leu 


Asp 


Gin 




680 




Leu 


Lys 


Arg 




695 




Asp 


Tyr 


Asp 




710 




Ser 


Thr 


Val 



Met 


Ser 


Ser 


Ser 


Ser 


Pro 


Thr 


Trp 


Asn 


Phe 


Lvs 


Leu 


Cys 


Thr 


Lys 


Glu 


Arq 


Ser 


Val 


His 


Phe 


Ala 


Glu 


Tyr 


Phe 


Met 


Cys 


Asp 


Gly 


Trp 


Arq 


Cys 


Asn 


Lys 


Pro 


Leu 


Glv 


Ser 


Asp 


Cys 


Ser 


Asn 


Lys 

J: 


Asp 


Asn 


Val 


Asn 


Val 


Gly 


Leu 


Cys 


Asp 


Cys 


Ser 


Arg 


Ser 


Phe 


Asp 


Glu 


Gly 


Gin 


Gly 

_z 


His 


Val 


Ser 


Ala 


Ser 


Asp 


Tvr 


Ser 


Lys 


Arg 


He 


He 


Thr 


He 


Ala 


Leu 


Val 


Arg 
SEQ- 


Pro 
-12 



325 






Cys 


Gly 


Gly 


340 






Tyr 


Tyr 


Pro 


--i i— i - 
355 






He 


Lys 


Val 


370 






Phe 


Tyr 


Leu 


385 






Asp 


Tyr 


Val 


400 






Gin 


Phe 


Val 


415 






His 


Ser 


Asp 


430 






Leu 


Ser 


Tyr 


445 






Lys 


Thr 


Gly 


460 






Ala 


Asp 


Cys 


475 






Ala 


Thr 


His 


490 






Phe 


Trp 


Val 


505 






Glu 


Glu 


Gly 


520 






Gly 


Lys 


Cys 


535 






Cys 


Gly 


Asp 


550 






Val 


Ser 


Cys 


565 






Leu 


Ser 


Lys 


580 






Asp 


Gly 


Ser 


595 






Thr 


Lys 


Gin 


610 






Glu 


Trp 


Pro 


625 






Leu 


Cys 


Gly 


640 






Ala 


His 


Cys 


655 






Thr 


Met 


Trp 


670 






Ser 


Ala 


Ser 


685 






His 


Pro 


Ser 


700 






Leu 


Glu 


Leu 


715 






He 


Cys 


Leu 







330 


Val 


Leu 


Ser 






345 


Gly 


His 


Tyr 






360 


Pro 


Asn 


Asn 






375 


Val 


Asp 


Pro 






390 


Glu 


He 


Asn 






405 


Val 


Ser 


Ser 






420 


His 


Ser 


Tyr 






435 


Asp 


Ser 


Asn 






450 


Arg 


Cys 


He 






465 


Pro 


Asp 


Tyr 






480 


Gin 


Phe 


Thr 






495 


Cys 


Asp 


Ser 






510 


Cys 


Ser 


Cys 






525 


Leu 


Pro 


Gin 






540 


Gly 


Ser 


Asp 






555 


Thr 


Lys 


Tyr 






570 


Gly 


Asn 


Pro 






585 


Asp 


Glu 


Lys 






600 


Ala 


Arg 


Val 






615 


Trp 


Gin 


Val 






630 


Ala 


Ser 


Leu 






645 


Phe 


Gin 


Asp 






660 


Thr 


Ala 


Phe 






675 


Gly 


Val 


Gin 






f C\ f\ 

690 


Phe 


Asn 


Asp 






705 


Glu 


Lys 


Ser 






720 


Pro 


Asp 


Ala 











725 






730 




735 


Thr 


His 


Val 


Phe 


Pro 


Ala 


Gly Lys Ala 


He 


Trp Val Thr 


Gly Trp 










740 






745 




750 


Gly 


His 


Thr 


Lys 


Glu 


Gly 


Gly Thr Gly Ala 


Leu He Leu 


Gin Lys 










755 






760 




*~I C ET 

765 


Gly 


Glu 


lie 


Arg Val 


He 


Asn Gin Thr 


Thr 


Cys Glu Asp 


Leu Met 










770 






775 




780 


Pro 


Gin 


Gin 


He 


Thr 


Pro 


Arg Met Met 


Cys 


Val Gly Phe 


Leu Ser 










785 






790 




795 


Gly 


Gly 


Val 


Asp 


Ser 


Cys 


Gin Gly Asp 


Ser 


Gly Gly Pro 


Leu Ser 










800 






805 




810 


Ser 


Ala 


Glu 


Lys 


Asp 


Gly 


Arg Met Phe 


Gin 


Ala Gly Val 


Val Ser 










815 






820 




825 


Trp 


Gly 


Glu 


Gly Cys 


Ala 


Gin Arg Asn 


Lys 


Pro Gly Val 


Tyr Thr 










830 






835 




840 


Arg 


Leu 


Pro 


Cys 


Ser 


Ser 


Gly Leu Asp Gin 


Arg Ala His 


Trp Gly 










845 






850 




o tr c" 

855 


lie 


Ala 


Ala 


Trp 


Thr 


Asp 


Ser Arg Pro 


Gin 


Thr Pro Thr 


Gly Met 










860 






865 




870 


Pro 


Asp 


Met 


His 


Thr 


Trp 


He Gin Glu 


Arg 


Asn Thr Asp 


Asp He 










875 






900 




905 


Tyr 


Ala 


Val 


Ala 


Ser 


Pro 


Pro Gin His 


Asn 


Pro Asp Cys 


Glu Leu 








910 






915 




920 


His 


Pro 



















<210> 11 

<211> 23 

<212> DNA 

<213> Artificial sequence 
<220> 

<221> n=Inosine 

<222> 6, 9, 12, 15, 18 

<223> Degenerate oligonucleotide primer 

<400> 11 

tgggtngtna cngcngcnca ytg 23 



<210> 


12 


<211> 


20 


<212> 


DNA 


<213> 


Artificial sequence 


<220> 




<221> 


n=Inosine 


<222> 


3, 6, 9, 12, 18 



SEQ-13 



<223> Degenerate oligonucleotide primer 
<400> 12 

arnggnccnc cnswrtcncc 

<210> 13 

<211> 12 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Fragment of TADG-15 
<400> 13 

Leu Phe Arg Asp Trp lie Lys Glu Asn Thr Gly Val 

5 10 

<210> 14 
<211> 20 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> TADG-15 forward oligonucleotide primer 
<400> 14 

atgacagagg attcaggtac 

<210> 15 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> TADG-15 reverse oligonucleotide primer 

<400> 15 

gaaggtgaag tcattgaaga 

<210> 16 
<211> 20 
<212> DNA 

<213> Artificial sequence 

SEQ-14 



<220> 

<223> p-tubulin forward oligonucleotide primer 
<400> 16 

cgcatcaacg tgtactacaa 20 

<210> 17 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> p-tubulin reverse oligonucleotide primer 

<400> 17 

tacgagctgg tggactgaga 20 



<210> 18 
<211> 3147 
<212> RNA 

<213> Artificial sequence 
<220> 

<223> Antisense of TADG-15 
<400> 18 

uuuuuuuuuu uuuuuuuuua aaaagaaaua 
caaauacaca cacugaaguc cacccuggga 
cucagacccg ucuguuuucc aggguccucc 
gcguggccca agggccccac agcccagauc 
cugaggaggc uccgaagcuc cguucccgcu 
gcagauauac acaaacgcgc cucggcccag 
cugugucuuc aaaccuuugc ccccaguugg 
ccuccccuuc uaccucccag cuccacuuug 
uuuucuaggc agauuuggag cccuggagau 
ugggggcgcu ggugcaguca gcgguccagu 
cacuggggug gacgaugggu ggccccgcag 
ccggccccua uaccccagug uucucuuuga 
agccuugugu acacgccugg cuuguuccuc 
gcucaccaca ccggccugga agauccgccc 
gggguccccc ggaaucaccc uggcaggagu 
cccacgcaca ucaugcgcgg cgugaucugc 
gguggucugg uugaugacgc ggaucucacc 
cagugccucc auacugggug uguccccagc 
ccggcaggga agacauggga ggcguccggc 
ggagcuguac ucugccgguu ucuccagcuc 
cgaaggugaa gucauugaag aagggguggg 



aauuguuuua cccauuuaca 50 

gcugguaaaa caauuucagu 100 

gagccugggc uuccucaaga 150 

cggcagcccc accaccuuca 200 

gcuccuuaca gacaggggag 250 

cuuggggcug gcgggggagg 300 

gucaguagaa ccaccagugu 350 

gaggcugagg aagcgagagg 400 

ugaguucaca guguauguuc 450 

cuccagccug caggcgugca 500 

guguacacau uuggguggcc 550 

uccagucccg aaacagaggg 600 

ugagcgcagc cgucucccca 650 

auccgccucc acgcuggaca 700 

ccacgccgcc gcugaggaag 750 

ugcggcagga gguucucgca 800 

cuuuugcagg aucagcgcgc 850 

ccgugaccca gauggccuug 900 

aggcagaugg gccgcaccau 950 

cagcagcgcg augucauagu 1000 

agaugaugcg cuugagccug 1050 



SEQ-15 



cgcuccugca ccccaggggc gcugcgcugg cucuggucgu gcaagcccag 1100 

gaaggccguc cacugcgugg ggucugagua ccugaauccu cugucaucga 1150 

uguagcagug ugcggcagag accagccagu ugggagagau gagggaagca 1200 

ccgcagaugu ggcccuggcc cagagcaugc aggcuuaccu gccagggcca 1250 

cucgcccuca uccgcauccg ugcccccaac aacacgagcc ugucucguga 13 00 

augaccgcag cccacagucg caguccuucu caucugagcc gucgcuacag 1350 

uccuccuucc cgucacacuc aggguugccc uugcucaagc agagcccauu 1400 

gaggcagcgg uagguguguu ugguacaagu gacgacguuc accuuggggc 1450 

aggaggccuc gucggacccg uccccacagu cguccuuccc auugcacugc 1500 

uggcuuuucg agaggcacuu cccauuggaa caccugaagg ucugggccgg 1550 

acaacugcac cccugcucgu cgcuguuguc uccgcagucg uucacacugu 1600 

cgcagaccca gaagaggggc uugcagaacu uguucuugca cgugaacugg 1650 

uggccggcgu cgcaacugca guugagcuca ucgcuguggu cggugcaguc 1700 

ggcccagcca ucacagcgca gcuccuuccg gauacaccgc cccgugcggc 1750 

acgugaacug ccccgggcau gggucacugg agucguagga gagguauuca 1800 

gcuaagaagc cggugucggu guaggacuga ucugagugga agcgaacugu 1850 

gaucuuguug cuguugcugg ugacgacgaa cugggaccuc ucuccgcagu 1900 

auuucucccc auugaucucc acguaguccu uggggcaggu gcccgcaggc 1950 

acgccgggcu ccagcaggua gaagaauuug aagcucaccu ucacaugcug 2 000 

guuguugggc accucaaugu uccaugugca gucaauguug gguggguagu 2 050 

ggccugggua guaggggcug uugaaugucc ccugggcuuu acguaagcgg 2100 

ccuccacagc ugcucauccu aggcagcugg aagaaggugg ccucaaagcc 2150 

gggaugccgc cgcucagugu ugguuaucag ugugaugagc aggacguucu 2200 

gggaggagug gaaggucagg uuguaggagg gaggguaggu gccacacaac 2250 

ugcaccaggg cguggggcuc cauggggcuc aggguguugu acaccgucac 23 00 

caggucgcug ccgcgcucgu cgcaggacgc aaggucaaag cugcggaagg 2350 

ugaggcucag cacugagucg gcgucccccc gcagggccca cuggcagcgg 2400 

gcaugagcgg gguaggggcu gucagggaag ccgggcgugg ugaagcgcau 2450 

cagcuccaca ccgcgggcgu gcaggccaaa gcugcagcug uuguccuggg 2500 

uccucuguac uguuuuggag uccgugggga aagccaccac ugaggugacc 2550 

acaaaggacu ucagggagcg cgcccgcggg ggcagcauga cuacgcgcuc 2600 

cucggccaug acgcgcucgg ccuccuccac caggugcugc gggaugcuga 2650 

acucagacca guaguaggcg augacgcugc ccucgcugaa ggccgucaca 2700 

gccgacuccu ugugguaggg gcccaggaau gggacuccgc uguacagcag 2750 

cuucagcgcg uccuucaccu ugcuggccag gcuuacaaac ucaguggagu 2800 

uggaguucuc guaggcaucc acaaaauucu cauuugugau ccucauguag 2850 

ccauugaaga ccuucuggac acgcacgucc cgguacugca aaugccacac 2900 

caggaagccg auccccagca agaccaagag gaggccgauc agcacggcug 2950 

ccagcaccac ccagcgcccc gggccaugcu uuuccaccuu cuugacguug 3 000 

uugacuggca ggaacuccac gccuuccucc aagccauuca cuuucucgug 3050 

ccgggaguug uacuugaguc ccgcgccgaa guccuucggg cccccuccgc 3100 

ccuugcgggc ccgaucgcuc cccaugguac cccgaggccg cucuuga 3147 

<210> 19 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 68-76 of the TADG-15 protein 
<400> 19 



SEQ-16 



Val Leu Leu Gly lie Gly Phe Leu Val 

5 

<210> 20 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 126-134 of the TADG-15 protein 
<400> 20 

Leu Leu Tyr Ser Gly Val Pro Phe Leu 

5 

<210> 21 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 644-652 of the TADG-15 protein 
<400> 21 

Ser Leu lie Ser Pro Asn Trp Leu Val 

5 

<210> 22 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 379-387 of the TADG-15 protein 
<400> 22 

Lys Val Ser Phe Lys Phe Phe Tyr Leu 

5 

<210> 23 
<211> 9 
<212> PRT 

SEQ-17 



<213> Homo sapiens 
<220> 

<223> Residues 386-394 of the TADG-15 protein 
<400> 23 

Tyr Leu Leu Glu Pro Gly Val Pro Ala 

5 

<210> 24 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 257-265 of the TADG-15 protein 
<400> 24 

Ser Leu Thr Phe Arg Ser Phe Asp Leu 

5 

<210> 25 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 762-770 of the TADG-15 protein 
<400> 25 

lie Leu Gin Lys Gly Glu lie Arg Val 

5 

<210> 26 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 841-849 of the TADG-15 protein 
<400> 26 

Arg Leu Pro Leu Phe Arg Asp Trp lie 

5 

SEQ-18 



<210> 27 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 64-72 of the TADG-15 protein 
<400> 27 

Gly Leu Leu Leu Val Leu Leu Gly lie 

5 

<210> 28 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 57-65 of the TADG-15 protein 
<400> 28 

Val Leu Ala Ala Val Leu lie Gly Leu 

5 

<210> 29 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 67-75 of the TADG-15 protein 
<400> 29 

Leu Val Leu Leu Gly lie Gly Phe Leu 

5 

<210> 30 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 



SEQ-19 



<223> Residues 379-387 of the TADG-15 protein 
<400> 30 

Lys Val Ser Phe Lys Phe Phe Tyr Leu 

5 

<210> 31 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 126-134 of the TADG-15 protein 
<400> 31 

Leu Leu Tyr Ser Gly Val Pro Phe Leu 

5 

<210> 32 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 88-96 of the TADG-15 protein 
<400> 32 

Lys Val Phe Asn Gly Tyr Met Arg lie 

5 

<210> 33 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 670-678 of the TADG-15 protein 
<400> 33 

Thr Gin Trp Thr Ala Phe Leu Gly Leu 

5 



<210> 34 



SEQ-20 



<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 119-127 of the TADG-15 protein 
<400> 34 

Lys Val Lys Asp Ala Leu Lys Leu Leu 

5 

<210> 35 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 60-68 of the TADG-15 protein 
<400> 35 

Ala Val Leu lie Gly Leu Leu Leu Val 

5 

<210> 36 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 62-70 of the TADG-15 protein 
<400> 36 

Leu lie Gly Leu Leu Leu Val Leu Leu 

5 

<210> 37 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 57-65 of the TADG-15 protein 
<400> 37 

SEQ-21 



Val Leu Ala Ala Val Leu lie Gly Leu 

5 

<210> 38 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 61-69 of the TADG-15 protein 
<400> 38 

Val Leu lie Gly Leu Leu Leu Val Leu 

5 

<210> 39 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 146-154 of the TADG-15 protein 
<400> 39 

Phe Ser Glu Gly Ser Val lie Ala Tyr 

5 

<210> 40 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 658-666 of the TADG-15 protein 
<400> 40 

Tyr lie Asp Asp Arg Gly Phe Arg Tyr 

5 

<210> 41 
<211> 9 
<212> PRT 

SEQ-22 



<213> Homo sapiens 
<220> 

<223> Residues 449-457 of the TADG-15 protein 
<400> 41 

Ser Ser Asp Pro Cys Pro Gly Gin Phe 

5 

<210> 42 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 401-409 of the TADG-15 protein 
<400> 42 

Tyr Val Glu lie Asn Gly Glu Lys Tyr 

5 

<210> 43 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 387-395 of the TADG-15 protein 
<400> 43 

Leu Leu Glu Pro Gly Val Pro Ala Gly 

5 

<210> 44 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 553-561 of the TADG-15 protein 
<400> 44 

Gly Ser Asp Glu Ala Ser Cys Pro Lys 

5 

SEQ-23 



<210> 45 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 97-105 of the TADG-15 protein 
<400> 45 

Thr Asn Glu Asn Phe Val Asp Ala Tyr 

5 

<210> 46 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 110-118 of the TADG-15 protein 
<400> 46 

Ser Thr Glu Phe Val Ser Leu Ala Ser 

5 

<210> 47 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 811-819 of the TADG-15 protein 
<400> 47 

Ser Val Glu Ala Asp Gly Arg lie Phe 

5 

<210> 48 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 



SEQ-24 



<223> Residues 666-674 of the TADG-15 protein 
<400> 48 

Tyr Ser Asp Pro Thr Gin Trp Thr Ala 

5 

<210> 49 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 709-717 of the TADG-15 protein 
<400> 49 

Asp Tyr Asp lie Ala Leu Leu Glu Leu 

5 

<210> 50 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 408-416 of the TADG-15 protein 
<400> 50 

Lys Tyr Cys Gly Glu Arg Ser Gin Phe 

5 

<210> 51 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 754-762 of the TADG-15 protein 
<400> 51 

Gin Tyr Gly Gly Thr Gly Ala Leu lie 

5 



<210> 52 



SEQ-25 



<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 153-161 of the TADG-15 protein 
<400> 52 

Ala Tyr Tyr Trp Ser Glu Phe Ser lie 

5 

<210> 53 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 722-730 of the TADG-15 protein 
<400> 53 

Glu Tyr Ser Ser Met Val Arg Pro lie 

5 

<210> 54 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 326-334 of the TADG-15 protein 
<400> 54 

Gly Phe Glu Ala Thr Phe Phe Gin Leu 

5 

<210> 55 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 304-312 of the TADG-15 protein 
<400> 55 

SEQ-26 



Thr Phe His Ser Ser Gin Asn Val Leu 

5 

<210> 56 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 707-715 of the TADG-15 protein 
<400> 56 

Thr Phe Asp Tyr Asp lie Ala Leu Leu 

5 

<210> 57 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 21-29 of the TADG-15 protein 
<400> 57 

Lys Tyr Asn Ser Arg His Glu Lys Val 

5 

<210> 58 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 665-673 of the TADG-15 protein 
<400> 58 

Arg Tyr Ser Asp Pro Thr Gin Trp Thr 

5 

<210> 59 
<211> 9 
<212> PRT 

SEQ-27 



<213> Homo sapiens 
<220> 

<223> Residues 686-694 of the TADG-15 protein 
<400> 59 

Ala Pro Gly Val Gin Glu Arg Arg Leu 

5 

<210> 60 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 12-20 of the TADG-15 protein 
<400> 60 

Gly Pro Lys Asp Phe Gly Ala Gly Leu 

5 

<210> 61 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 668-676 of the TADG-15 protein 
<400> 61 

Asp Pro Thr Gin Trp Thr Ala Phe Leu 

5 

<210> 62 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 461-469 of the TADG-15 protein 
<400> 62 

Thr Gly Arg Cys lie Arg Lys Glu Leu 

5 

SEQ-28 



<210> 63 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 59-67 of the TADG-15 protein 
<400> 63 

Ala Ala Val Leu lie Gly Leu Leu Leu 

5 

<210> 64 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 379-387 of the TADG-15 protein 
<400> 64 

Lys Val Ser Phe Lys Phe Phe Tyr Leu 

5 

<210> 65 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 119-127 of the TADG-15 protein 
<400> 65 

Lys Val Lys Asp Ala Leu Lys Leu Leu 

5 

<210> 66 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 



SEQ-29 



<223> Residues 780-788 of the TADG-15 protein 
<400> 66 

Leu Pro Gin Gin lie Thr Pro Arg Met 

5 

<210> 67 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 67-75 of the TADG-15 protein 
<400> 67 

Leu Val Leu Leu Gly lie Gly Phe Leu 

5 

<210> 68 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 283-291 of the TADG-15 protein 
<400> 68 

Ser Pro Met Glu Pro His Ala Leu Val 

5 

<210> 69 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 12-20 of the TADG-15 protein 
<400> 69 

Gly Pro Lys Asp Phe Gly Ala Gly Leu 

5 



<210> 70 



SEQ-30 



<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 257-265 of the TADG-15 protein 
<400> 70 

Ser Leu Thr Phe Arg Ser Phe Asp Leu 

5 

<210> 71 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 180-188 of the TADG-15 protein 
<400> 71 

Met Leu Pro Pro Arg Ala Arg Ser Leu 

5 

<210> 72 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 217-225 of the TADG-15 protein 
<400> 72 

Gly Leu His Ala Arg Gly Val Glu Leu 

5 

<210> 73 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 173-181 of the TADG-15 protein 
<400> 73 

SEQ-31 



Met Ala Glu Glu Arg Val Val Met Leu 

5 

<210> 74 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 267-275 of the TADG-15 protein 
<400> 74 

Ser Cys Asp Glu Arg Gly Ser Asp Leu 

5 

<210> 75 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 567-575 of the TADG-15 protein 
<400> 75 

Cys Thr Lys His Thr Tyr Arg Cys Leu 

5 

<210> 76 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 724-732 of the TADG-15 protein 
<400> 76 

Ser Ser Met Val Arg Pro lie Cys Leu 

5 

<210> 77 
<211> 9 
<212> PRT 

SEQ-32 



<213> Homo sapiens 
<220> 

<223> Residues 409-417 of the TADG-15 protein 
<400> 77 

Tyr Gys Gly Glu Arg Ser Gin Phe Val 

5 

<210> 78 
<211> 9 
<212> PRT 

<213> Homo sapiens 
<220> 

<223> Residues 495-503 of the TADG-15 protein 
<400> 78 

Thr Cys Lys Asn Lys Phe Cys Lys Pro 

5 

<210> 79 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 427-435 of the TADG-15 protein 
<400> 79 

Val Arg Phe His Ser Asp Gin Ser Tyr 

5 

<210> 80 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 695-703 of the TADG-15 protein 
<400> 80 

Lys Arg lie lie Ser His Pro Phe Phe 

5 

SEQ-33 



<210> 81 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 664-672 of the TADG-15 protein 
<400> 81 

Phe Arg Tyr Ser Asp Pro Thr Gin Trp 

5 

<210> 82 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 220-228 of the TADG-15 protein 
<400> 82 

Ala Arg Gly Val Glu Leu Met Arg Phe 

5 

<210> 83 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 492-500 of the TADG-15 protein 
<400> 83 

His Gin Phe Thr Cys Lys Asn Lys Phe 

5 

<210> 84 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

SEQ-34 



<223> Residues 53-61 of the TADG-15 protein 
<400> 84 

Gly Arg Trp Val Val Leu Ala Ala Val 

5 

<210> 85 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 248-256 of the TADG-15 protein 
<400> 85 

Leu Arg Gly Asp Ala Asp Ser Val Leu 

5 

<210> 86 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 572-580 of the TADG-15 protein 
<400> 86 

Tyr Arg Cys Leu Asn Gly Leu Cys Leu 

5 

<210> 87 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 692-700 of the TADG-15 protein 
<400> 87 

Arg Arg Leu Lys Arg lie lie Ser His 

5 



<210> 88 



SEQ-35 



<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 24-32 of the TADG-15 protein 
<400> 88 

Ser Arg His Glu Lys Val Asn Gly Leu 

5 

<210> 89 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 147-155 of the TADG-15 protein 
<400> 89 

Ser Glu Gly Ser Val lie Ala Tyr Tyr 

5 

<210> 90 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 715-723 of the TADG-15 protein 
<400> 90 

Leu Glu Leu Glu Lys Pro Ala Glu Tyr 

5 

<210> 91 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 105-113 of the TADG-15 protein 
<400> 91 

SEQ-36 



Tyr Glu Asn Ser Asn Ser Thr Glu Phe 

5 

<210> 92 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 14-22 of the TADG-15 protein 
<400> 92 

Lys Asp Phe Gly Ala Gly Leu Lys Tyr 

5 

<210> 93 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 129-137 of the TADG-15 protein 
<400> 93 

Ser Gly Val Pro Phe Leu Gly Pro Tyr 

5 

<210> 94 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 436-444 of the TADG-15 protein 
<400> 94 

Thr Asp Thr Gly Phe Leu Ala Glu Tyr 

5 

<210> 95 
<211> 9 
<212> PRT 

SEQ-37 



<213> Homo sapiens 
<220> 

<223> Residues 766-774 of the TADG-15 protein 
<400> 95 

Gly Glu lie Arg Val lie Asn Gin Thr 

5 

<210> 96 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 402-410 of the TADG-15 protein 
<400> 96 

Val Glu lie Asn Gly Glu Lys Tyr Cys 

5 

<210> 97 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 482-490 of the TADG-15 protein 
<400> 97 

Asp Glu Leu Asn Cys Ser Cys Asp Ala 

5 

<210> 98 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> Residues 82-90 of the TADG-15 protein 
<400> 98 

Arg Asp Val Arg Val Gin Lys Val Phe 

5 

SEQ-38 
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